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PREFACE TO THE FIRST EDITION 


Statistics today is an indispensable part of every human activity. The subject, therefore, 
forms part of the syllabus of degree and post-degree as well as several professional and 
competitive examinations. Entire text is designed to eater to the syllabi of indian universities. 


The purpose of this book is to treat statistics as a self-contained mathematical subject 
rigorously, avoiding non-mathematical concepts. The book assumes no previous knowledge 
of the subject on the part of the reader and aims at complete clarity for the beginner and such 
simplicity of exposition as will make the text practically self-teaching. 


Although the textual material is concise and to the point, attention has been paid to the 
development of the underlying concepts. A serious attempt has been made to unify methods. 


The subject is presented in a modulated and graded manner beginning with a fundamental 
core of introductory material which develops gradually from the simple and the easy to the 
complex and the intricate. 


The examples are accompanied by problems. Some of them are simple exercises but most 
of them serve as additional illustrative material to the text or contain various complements. 
One purpose of the examples and problems is to develop the reader’s intuition. Several 
previously treated examples show that apparently difficult problems may become almost trite 
once they are formulated in a natural way and put into proper context. To enhance further the 
utility of the book, thought provoking questions, carefully selected and systematically 
arranged are added at the end of each chapter. Problems drawn from latest examinations of 
various universities are an added attraction of the book. 


It is hoped that book will prove to be of much utiliy to the students as well as teachers __ 
of the subject. . . 
The author would like to express his appreciation to Mr. V. S. Prasad for his useful 


suggestions. Author’s thanks are also for Miss. Neeru Kapoor for her carefully reading the 
material and help. 


I take this opportunity to thank the various well-known authorities of the world from 
whom I have drawn an inspiration. I am also grateful to all my colleagues in the Deptt. of 
Mathematics for the kind help they have given in the preparation of this book. 


The author is also idebled to publishers and printers for their co-operation in this 
endeaver. 


Suggestions to further improvement are welcome and will be most gratefully 
acknowledged. 


N.M. KAPOOR 
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Introductory 


0.1. The word statistics and the related words ‘statistical’ and ‘statistician’ have various 
meanings. To a man in the street statistics are only figures and statistician as one who counts 
the number of things. To the economist, statistical stands for the quantitative and to the 
physicist statistical is opposite of individualistic or exact. The word statistics have either 
been derived from Status (Latin word) or Statistica (Italian) or Statistik (German) each of 
which means an “organized political state”. Originally statistics was related to the collection 
of factual details concerning a state and this is why earlier it was known as the “Science of 
State Craft”. : 

The word ‘statistics’ when used in plural, means the numerical data collected in an 
orderly manner with specific end in view but in its singular meaning it means the theoretical 
science with techniques which deals in to collect, analyse and draw conclusions from the 
data. 

Statistics is both a science and an art. It is a science because its methods, like other 
branches of science, are basically systematic.and have general application and is an art in 
that their successful application depends to a considerable degree on the skill and experience 
of the statistician. 


0.2. Scope of Statistics 

The applications of statistics are so numerous and ever-increasing that not only it is 
difficult to define its scope but also unwise to do so. These days it is introducing into every 
branch of human knowledge. A student of science, while conducting an experiment, has to 
rely upon the application of statistics. In biology, testing of significance is applied to compare 
the effects of two drugs, the law of probability is used in radiation when the cells in the retina 
of the eye are exposed to light and statistical papers are used to study heart-beats through 
electrocardiogram. : 

Statistics is also used in agriculture and used in this way it is called Agricultural Statistics. 
Various Agricultural problems are solved or simplified by applying some suitable scheme 
for collection, analysis and the interpretation. 

Statistics is an important member of the mathematical family. It is regarded as a branch 
of applied mathematics which specialises in data. Statistical techniques are of great assistance 
in the defence strategies to plan maximum destruction with minimum effect. Also for the 
day-to-day functioning Government depends upon statistics. In Insurance also statistics is 
used, Calculation of premiums and annuity etc., is wholly a statistical work based on the 
theory of probability and expectation. 


0.3. Distrust of Statistics 
Inspite of the very valuable service that statistics renders, there is some amount of 
misgiving in the minds of a few people with regard to its reliability and usefulness. It is said : 


[i 


tii) MATHEMATICAL STATISTICS 


(1) “With statistics anything can be proved”. 
(2) There are three kinds of lies : namely (i) lies (ii) damned lies and (iii) statistics- 
wicked in the order of their meaning. 


0.4. Limitations of Statistics 


Statistics has its own limitations. It cannot be applied to all kinds of phenomena and 
cannot be made to answer all queries. Few of its main limitations are listed below : 

(1) It deals only with those subjects of inquiry which can be measured quantitatively 
and can be expressed numerically. 

(2) It deals only with aggregates of facts and no importance is attached to individual 
items. oO 

(3) Statistical data is only approximately and not mathematically correct. 

(4) Statistics can be used to establish wrong conclusions and therefore can be used only 
by experts. — 

The methods by which statistical data are analyzed are called statistical methods. These 
methods range from the most elementary descriptive devices which may be understood by 
the common man to those complicated mathematical procedures which can be apprehended 
only by the expert theoreticians. The mathematical theory which is the basis of these methods 
is called the theory of statistics or mathematical statistics. The purpose of this text is to 
discuss the fundamental principles and theory of statistics in simple and easily comprehensible 
manner. 
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Frequency Distribution and Measures of 
Central Tendency 


1.1. Introduction 


By a variable is meant a quantity which assumes different values. A variable may be 
continuous or discrete. Ifa variable can take any numerical value within a certain range it is 
called a continuous variable. A variable, for which there is a gap between its two successive 
values, is called discrete. Heights, weights are examples of continuous variable and marks, 
test scores, efc., are discrete variables. , 

In this and next few Chapters only discrete variables will be considered although the 
results obtained will also be true for continuous variables. 


1.2. Frequency Distribution 

A frequency distribution is one where the values of the variable and the number of 
times each value is taken are put together. The number of times the value is taken is 
called the frequency of that value. The sum-of all the frequencies is called total 
frequency. 

When the values of the variable are presented in the form of groups, the 
representation is called grouped frequency distribution. The groups are called classes 
and the boundary figures are called class limits. The figures on the left are called lower 
limits and those on the right are called upper limits. The difference between the two 
limits is called class-interval or width of the class. The frequency of the class is called 
class frequency. The values midway between lower and upper limits are called mid- 
values or central values. 

For computational purposes, it is assumed that variate takes mid-values only. Thus, a 
frequency distribution can be taken in the form 


where x,, x, ...,x, are the values of the variable x with frequencies f,, f,-.. fy 
Let fi tf, t+..+f, HN 
Then, N is total frequency. 


1.3. Graphic Representation of a Frequency Distribution . 


It is one of the most convincing and appealing method of presenting a data. Graphs give 
a bird’s eye-view of entire data and therefore information presented is easily understood. 


_These help one in making quick and accurate comparison of data. 
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A frequency distribution can be presented graphically in any of the following ways : 
(é) Histogram , 

It is a set of adjacent rectangles whose areas are proportional to the frequencies. 

To draw histogram, class-intervals are marked along x-axis on a suitable scale. On the 
class-intervals as bases rectangles are drawn with heights proportional to the ratio of 
corresponding frequency and class-width. For equal class-intervals the heights of the 
rectangles will be proportional to the frequencies. 

The figure below exhibits the histogram of the following frequency distribution, : 

Marks No. of students Marks No. of students 


0-9 9 25-35 30 

9-20 33 35-40 10 

20-25 10 40-50 30 
3 
2 
41 

0 910 20 25 30 35 40 50 
marks 


The x-axis, of course, need not start with zero but the y-axis must start with zero 
and must have no scale break. Sometimes a space is left between the first rectangle and 
the vertical axis. The base of each rectangle is labelled on both sides in terms of the 
class-limits if the data is continuous (i.e., the upper limit of one class and the lower 
limit of the following class coincide). In discontinuous data, only the lower limits are 
marked on x-axis. Some statisticians, however, in presenting discontinuous data, leave 
small gaps between the rectangles and label both limits of each class. Another way of 
labelling the horizontal axis on the histogram is by showing the mid-value in the middle 
of the base of the rectangle. . : 

The histogram sometimes is also used even for representing ungrouped data. Here 
different values are regarded as the mid-points of the classes and corresponding frequencies 
are supposed to spread over the whole class. 

However, for data with open-end classes histogram cannot be constructed. One solution 
to this problem is to plot the histogram without the open-end class or classes and to add the 
information concerning them in figures. 


(ii) Frequency Polygon 


_ Itis used to represent the grouped or ungrouped frequency distribution. For a grouped 
frequency distribution mid-points of the classes are taken as variate values and frequency of 
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the class is taken as frequency of the mid-point. Thus, the data is obtained in the form : 


é Xe ass | 
| fede Is 

* Points (x,, f,), (Xz. fz), ---(X%q» f,) are plotted on the graph paper and successive points 
are joined by straight lines. 

The point (x,, f,) is joined with (x 9,0) and the point (x,, f,) with (x,,,,0) through 
dotted lines. 

The polygon so obtained is called frequency polygon] 

For equal class-intervals, the frequency polygon can be obtained by joining the middle 
points of the upper sides of the adjacent rectangles of the histogram by means of straight 
lines. 

Frequency polygons for two or more frequency distributions can be shown on a single 
graph which is not possible for histograms. Hence, frequency polygons are preferred for the 
graphic comparison of various frequency distributions. 

Thus, for reading the relationship of individual class frequencies to the total, 
histogram presents:a far better picture and as such is preferred than the corresponding 
frequency polygon. 

(iii) Frequency Curve 

If the number of observations goes on increasing, the size of the class-interval can be 
reduced without noting down a fall in the individual class frequencies. The points of the 
frequency polygon will go on becoming nearer horizontally and hence an approximate smooth 
curve can be obtained of passing through most of these points. 

The larger the number of observations would make the class intervals smaller and 
consequently the corresponding frequency polygon would approach more and more closely 
to become a smooth curve. 

This smooth curve is called frequency curve. It is obtained by smoothing either the 
polygon or various tops of the histogram. The curve is drawn freehand in such a manner that 
area under the curve is approximately the same as that of the polygon. The curve should look 
as regular as possible and all sudden turns should be avoided. The curve should begin and 
end at the base line and as a general rule it may be extended to the mid-points of the classes 
just outside the histogram. __ 

The area under the curve should represent the total frequency. 


(iv) Cumulative Frequency Curve or Ogive 

Here cumulative frequencies are used and not the class frequencies. Cumulative 
frequencies are obtained by adding frequencies either from top to bottom or from bottom to 
top. Points are then plotted with upper (or lower) limits of the classes as abscissae and 
corresponding cumulative frequency as ordinates and successive points are then joined by’ 
straight lines. ; 

The graph so obtained is called Cumulative frequency polygon, Cumulative frequency 
Curve is obtained by approximating the polygon through the smooth curve. 

The cumulative frequency curve is also called ogive. 

The cumulative frequencies taken from top (bottom) to bottom (top) are called less 
(more) than type. These are plotted against upper (lower) class limits. Polygon or curve so 
obtained is called less (more) than type cumulative frequency polygon or curve. Starting 
(finishing) point of this curve would be the origin (point on x-axis with upper limit of last 
class as abscissa). 
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These curves are frequently used in estimating various partition values e.g., if a line 


N 
corresponding to a is drawn parallel to x-axis, first quartile will be obtained. Similarly, the 


other partition values can be obtained. 

If instead of cumulative frequencies percentage cumulative frequencies (i.e., cumulative 
frequency expressed as percentage of the total frequency) are taken, curve obtained is called 
percentage cumulative frequency curve. This is-useful in comparing different frequency 
distributions. 

Remark. Some special types of curves which generally we come across in statistics 
are : 

() Symmetrical curves such curves can 
be folded along a vertical line so that two 
halves of the curve coincide. Such curves arise 
for a distribution in which class-frequencies 
go on decreasing symmetrically on either side 


of central value. Symmetrical Curve 


(ii) Moderately asymmetrical or skewed curves 

Curves which are not symmetrical are 
called skewed. Such curves arise for a 
distribution in which frequencies decrease with 
significant rapidity on one side of the 
maximum than on the other. 


(iii) Extremely asymmetrical (or J-shaped) curves 
Such curves are of two shapes : J-shaped Curve 
J-shape and t-shape (reverse J-shape). 
These curves occur for distribution in 

which frequency is maximum at one end of 

value and decrease to minimum to other end. 


(iv) U-shaped curves t-shaped Curve 
These curves are of shape U and occur 


for a distribution in which frequencies increase 
as we move from centre towards ends of y 


values. 


1.4, Measures of location or central tendency eehaped Clas 


These are statistical constants which give an idea about the concentration of the values 
in the central part of the distribution. It can be thought of as the value of the variable which 
is representative of the entire distribution. The following are the various measures of central 
tendency. 


(4 Arithmetic Mean. It is defined by 


za mhit mh + sacccees +x,f,, 
Sit fy tev +f, 


To obtain it for a given data, calculations are simplified by taking the variate 
defined by 


FREQUENCY DISTRIBUTION AND M 
plana a Oe 


where ‘a’ and ‘h’ are to be chose. 


where u is A.M. of u. 
Ex. 1-1. Show that the algel 
arithmetic mean is zero. 


Sol. Let x,,%2 ....,%, be the 


Let N=f\+ 

Let x be A.) 
n 

Now ic: 
i=l 


Ex, 1-2. Show that the arith 


Sol. A 


Ex. 1-3. From the data givei 
SN. 


CoOIdauNnhRwne 


Sol. (i) Direct Method. 
Arithmetic Mean (A.M.) =- 


17+324+35+33+15+2) 


gees = 23-94. 
18 
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| partition values e.g., if a line where ‘a’ and ‘h’ are to be chosen suitably. Then A.M. is given by 
| x=athu 
will be obtained. Similarly, the where i is A.M. of u. 
Ex. 1-1. Show that the algebraic sum of the deviations of a set of values from their 


ve frequencies (i.e., cumulative arithmetic mean is zero. 


taken, curve obtained is called Sel. Let x), x2 ....,x, be the set of values with frequencies f,, fy ... fy 
‘omparing different fr 
se cia Let Nzefithfyt + fy 


y we come across in statistics iw 
Let x be A.M. Then X=— ) fix; 
N > hai 


i=] 


Now Yfli-0= Sin 3A 
i=! i=l i=! 


‘mmetrical Curve = Nx -XN 
=0 
| ; : n+l 
Ex. 1-2. Show that the arithmetic mean of the first n natural numbers is ae 
1+2+...+ 
Sol. AM,=———"* 
n 
J-shaped Curve _n(n+l)_ntl 
2n 2 
Ex. 1-3. From the data given below, calculate mean : 
S.N. Marks S.N. Marks 
. 1 17 10 18 
See 2 32 11 20 
t-shaped Curve 3 35 12 22 
4 33 13 11 
5 15 14 15 
, 6 21 15 35 
l-shaped Curve 7 @ 16 sae 
8 32 17 38 
concentration of the values 9 11 18 12 
? value of the variable which | Sol. (i) Direct Method. 
> various measures of central Sx 
i Arithmetic Mean (A.M.) = = 


_174324354+33+15+214+414+32+11+18+20+22 +11+15+354+23+38 +12 
7 18 


fied by taking the variate = = = 23-94. 


(ii) Short-cut method 


_ 414 +17 _ 431 


= 23-94. 
1818 
Ex. 1-4. Calculate the mean from the following data : 
Size Frequency Size 
4—8 6 24—28 
* 8—12 10 28—32 
12—16 18 32—36 
16—20 30 36—40 
20—24 15 


> fu 62 
z 2 \ x4=22-—— x 4 
am.=22+ [=] « i09 


= 19-725. 
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Frequency 
12 
10 


FREQUENCY DISTRIBUTION AND 


Ex. 1-5. Calculate the mea 


Income between 
i (in Rs.) 


100—200 
100—300 
100—400 


The data is given in the f 

mean, it is to be converted into 

.Now the number of person: 

persons having income betweer 

. The number of persons | 

In a similar manner the f 
calculated. 


Income | Given Freq. (c.f) 


100—200 


200—300 33 
300—400 63 
400—500 83 


500—600 


6 
= + —— 100 
, A.M. = 350 100 


= 356. 
Ex. 1-6. Calculate the mea 


Marks No. 0 
More than 0 
More than 10 
More than 20 
More than 30 


Sol. The data is given in cu 
first converted into ordinary fre: 
Now the number of student 


and the number of students getti 
“. The number of students ; 


Proceeding likewise the fr 
obtained. 

In the end since there is no s 
will be 60—70. 

Thus, we have the table : 
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pera cree inreeb ican fecal 


94, 


Frequency 
12 
10 


Income between Nooroenons Income between 
(in Rs.) P (in Rs.) 
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100—200 100—500 83 


100—360 100—600 100 
100—400 


Ex. 1-5. Calculate the mean from the oe data : 


The data is given in the form of cumulative frequency distribution. For calculating 
mean, it is to be converted into an ordinary frequency distribution. 
Now the number of persons having income between 100—200 = 15 and the number of 
persons having income between 100—300 = 33. 
. The number of persons having income between 200—300 is 33 — 15 = 18. 
In a similar manner the frequencies of groups 300—400, 400—-500 erc., can be 
calculated. 


Income 


(= ae) 
Given Freq. (c.f) | Frequency (f) | (x) Mid pt. | ¥= 


He 15 = 18 
63-33 = 30 
83-63 = 20 
100-83 = 17 


100—200 
200-—300 
300—400 
400-—500 
500—600 


“ A.M. = 350 + ae 100 


100 
= 356. 
Ex. 1-6. Calculate the mean for the data given below : 
Marks No. of students Marks No. of students 
More than 0 100 More than 40 25 
More than 10 90 More than 50 15 
More than 20 75 More than 60 5 
More than 30 * 50 More than 70 ~ 0 


Sol. The data is given in cumulative distribution type. For calculating mean it is to be 
first converted into ordinary frequency distribution. 
Now the number of students getting marks more than 
‘0’ = 100 
and the number of students getting marks more than 
‘10’ = 90 
.. The number of students getting marks between 
0—10 = 100 —- 90 = 10. 
Proceeding likewise the frequencies of other classes 10—20, 20—30 efc., can be 
obtained. 
In the end since there is no student getting marks more than 70, the last class in the table 
will be 60—70. 
Thus, we have the table : 
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Note. Classes below are closed from right. 


Sol. 


100 — 90 = 10 -3 -30 - 
90 —- 75 = 15 


= 31. 
Ex. 1-7. The following table gives the frequency distribution of monthly salaries of 70 
employees of company X. 


Salary (in Rs.) No. of Employees 
100—119 8 
120—139 10 
140—159 14606 o™N 
160—179 15 
180—199 10 
200—239 8 
240—259 _3 

70 


Gonupute the arithmetic mean. 

Sol. The class intervals are given in inclusive forms. A value less than 119-5 will be 
counted in first class and a value greater than or equal to it in the second, so for calculation 
the class 100—119 is as good as 99-S—119-5 and so on. Thus, we have the following table : 


99-5—119-5 
119-5—139-5 
139-S—159-5 


159-5—179-5 
179-5—199-5 
199-5—239.5 
239:5—259°5 


36 
M. = 169-:5-— x10 
A.M 70 
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Ex. 1-8. Ilustrate by an exc 
(i) adding ‘a’ to every 

(ii) subtracting ‘a’ fron 
(iii) Multiplying every ii 
(iv) Dividing every item 
on the arithmetic mean of a 


Sol. Let the series be 


A 


(i) By adding ‘a’ to every it 
2+ 


A 


so that arithmetic mean also inci 
(ii) By subtracting ‘a’ from 
2 


so that arithmetic mean also dirr 
(iii) By multiplying each ite 


so that arithmetic mean is also n 
(iv) By dividing each item t 


so that arithmetic mean is also d 
Ex. 1-9. [f m,, m, be the ari 
Find the A.M. of the series obta 
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ition of monthly salaries of 70 


Employees 


. Value less than 119-5 will be 
i the second, so for calculation 
3, we have the following table : 
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36 
= 169:5-— 
7 


= 169-5-5-143 = 164-357. 
Ex. 1-8. //Justrate by an example the effect of 
(i) adding ‘a’ to every item. 
(ii) subtracting ‘a’ from every item. 
(iti) Multiplying every item by ‘a’. 
(iv) Dividing every item by ‘a’ 
on the arithmetic mean of a series. 


Sol. Let the series be 
2, 3, 4, 5, 6 
24+3+44+5+6 20 
A.M. = ee ee 


(i) By adding ‘a’ to every item, the new series is 
2+a, 3+a,4+a, 5+a, 6+a 
es (2+a)+(3+a)+(4+a) +(S+a)+(6+a) 
5 
=4+a 


so that arithmetic mean also increases by ‘a’. 
(ii) By subtracting ‘a’ from each item, the new series is 


2-a,3-a,4-a,5-a,6-a 


(2—a) +(3-a)+(4-—a) +(5-a)+(6-a) 


A.M. = 
m 5 


=4-a 
so that arithmetic: mean also diminishes by ‘a’. 
(iii) By multiplying each item by ‘a’, the new series is 
2a, 3a, 4a, Sa, 6a 


€ 2a+3a+4a+5a+6a 


A.M. 5 


= 4a 
so that arithmetic mean is also multiplied by ‘a’. 
(iv) By dividing each item by ‘a’, the new series is 


Lf 233" A. Si. 6 
AMoe Slot tae tS 


so that arithmetic mean is also divided by ‘a’. 
Ex. 1-9. /f m,, m, be the arithmetic means for two series of sizes n, and n3 respectively. 


Find the A.M. of the series obtained on combining them. 
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Sol. Let ¥1,%22++-%,, and Yi» V2 ++ Yn, be the items of two series respectively. Then 


1 
m, = —(x +X +o. tq) 
ny 


yice mcr bye i) 


Let m be the A.M. of the combined series. 


1 
ny +N, 


Then m= {(xi + xg tity) +(Yp +2 te tI )} 


Ses! {mn, + m,n, }- 
ny +n, 
Ex. 1-10. The mean of the marks obtained in an examination by a group of 100 students ' 
was found to be 49:46. The mean of the marks obtained in the same examination by another 
group of 200 students was 52:32. Find the mean of the marks obtained by both the groups of 
students taken together. 


Sol. Here m, =49-46, —-n, = 100 ‘ 
my = 52-32, ny = 200 
4946 + 10464 
m* 100 + 200 
_ 15410 
300 
= 51:37, 


Ex. 1-11. Zwo groups of students reported mean weights of 162 and 148 pounds 
respectively. When would the mean weights of both groups together be 155 pounds? 


Sol. Here m = 155, m, = 162, m, =148. Let n, and ny be the sizes of two groups. 


nym, +mn 
Tidal 2n2 
Then, | i Cal 


1s5=[ a Jisa( m2 ) oss 


-| nm asi al ) oe 
ny tng ny +n 


. (| (162 — 148) +148 
ny +N, 


155-148 =7 


if La 
- 
Le aa a 
= 
+ |S 
3 
nN 
ae” 
ll 
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or 


. Two groups must be of 
Ex. 1-12. The mean annu 
The mean annual salaries paic 
respectively. Determine the pe. 


Sol. Here m = 5000, my, : 
and female employees. 


’ Now, 


or 


ny 8 
or =-- 
ntn, 10 
fey 8 
ny +n ny + 


-, no. of male employee 

and _ no: of female em 

Ex. 1-13. The populatio. 
25,000 and 24,000 respective. 
was 280, 270, 240, 230 and 3 
Find out the average inc. 


Sol. Let 1, 12,3, 145M: 

the average income of the res 
n, = 20,0! 

and my, = 280, 
Let m be the average inc 


Then 
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wo series respectively. Then 


i) 
n) 


AX_,) + (9) + 2b tng )} 


ate 


tion by a group of 100 students 
+ same examination by another 
obtained by both the groups of 


ights of 162 and 148 pounds 
together be 155 pounds? 


be the sizes of two groups. 


mn (148) 


pry 


ape ) cs 


ny + Ny 


48) + 148 
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2ny =n +Ny 


or ny = Ny 

”. Two groups must be of same size. 

Fx. 1-12. The mean annual salary paid to all employees of a company was Rs. 5000. 
The mean annual salaries paid to male and female employees were Rs. 5200 and Rs. 4200 
respectively. Determine the percentage of males and females employed by the company. 

Sol. Here m= 5000, m, = 5200, m, = 4200. Let n, and n, be the number of male 
and female employees. 


nym + NyM, 


’ Now, m= e 
ny + Ny 
ny ny 
5000 = |—— (5200) + (4200) 
ny +n, ny +N 
or so~( “A J oms(1- “4 ) 

ny tng n, +n 

ny 
= | —1— | (52-42) + 42 
ny +My 


ny 8 
or =— 
ntn, 10 
ny =|- ny =1 8 = 2 
ny +N ny +N 10 10 
+, no. of male employees = 80% 


and _ no: of female employees = 20%. 

Ex. 1-13. The population of five towns A, B, C, D, E was 20,000; 26,000; 23,000; 
25,000 and 24,000 respectively. The average income of the resident for the respective towns 
was 280, 270, 240, 230 and 300. 

Find out the average income per head for all the towns combined. 


Sol. Let Ny, Nz,Nz,M4,Ns5 be the population of five towns and my, mz, 13, M4, IMs be 
the average income of the resident for the respective towns. Then 
n, = 20,000, n, = 26,000, n; = 23,000, ng = 25,000, n, = 24,000 
and m, = 280, m, = 270, m; = 240, m4 = 230, m, = 300 
Let m be the average income per head for all the towns combined. 


mn + M,N, + m3N3 + myn, + M5Ns5 
ied Cad Lith el “enna r- oaoens SOE 


Then m= 
ny + Ny +N3 +g + Ns 


5600 + 7020 + 5520 +5750 + 7200 
20+26423+25+24 
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=e 263-47. 
“118 


Ex. 1-14. What is the average of daily wages for the workers of the two factories 
combined : 


Factory A Factory B 
No. of wage earners 250 200 
Average daily wage Rs. 2:00 Rs. 2:50 


Sol. Let m be the average of daily wages for the workers of the two factories 
combined. 


(250) (2) + (200) (2-5) 


Then, m= 250 + 200 
_ 50+50 100 
45. 45 

= Rs, 2:22. 


Ex. 1-15. The mean marks of 100 students were found to be 40. Later on it was discovered 
that a score of 53 was misread as 83. Find the corrected mean corresponding to the corrected 
score. 

Sol. Let x be the variable for the marks. 


Then, x =40 
and n(= size) = 100 
Lu Xx =40 
n 
= 40x 100 = 4000 
Corrected value of Xx = 4000 - 83+ 53 
= 3970 
70 
Corrected mean = gen = 39-7 


100 


(ii) Geometric and Harmonic Means 
Geometric Mean: /t is defined by 


uN 
G= {x Se na 


were N= fi t+ fot..tf,- 
For a given data it is obtained by fi eel M. of log x and then taking ‘antilog’. 
Harmonic Mean : /t is defined by 


1 1 1 
foemt he mtent heme 
1 xy xX x 


—_= n 


H fit fo toc f, 


where H is the harmonic mean. 


1 
For a given data it is obtained by finding A.M. of cS and then taking its reciprocal. 
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Ex, 1-16. Find the geome. 
Sol. Let x be the variable. 


“. Values of y = logig x 


0, logyy 2,2. 


logio (' 


Ex. 1-17. Calculate Geon 
6°5, 169-0 


Sol. 


1 
logi9 (G.M.) = a 
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orkers of the two factories 
Factory B 
200 


Rs. 2°50 
‘kers of the two factories 


9. Later on it was discovered 
‘responding to the corrected 


! then taking ‘antilog’. 


then taking its reciprocal. 
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Ex. 1-16. Find the geometric mean of the series 1, 2, 4, 8, ....0604 2". 


Sol. Let x be the variable. Then values of x are 


. Values of y = logig x are 


0, logyg 2,2 logig 2, 3 logig 2,...n logyg 2. 
0+ logig 2+2 logig 2+3 logig 2+ 


logjg (G.M.) = (aad) 
1 
= {1 2) —— 414+2+...4 
(logio 2) ree { n} 
1 n(n+)) 
= (log,, 2} —— -——— 
( £10 oie ) 
n 
m 5 [810 2 
= logio ae 
GM. = gna 


eth logig 2 


Ex. 1-17. Calculate Geometric and Harmonic means from the following data : 
6°5, 169-0, 11-0, 112°5, 14-2, 75-5, 35-5, 215-0. 


Sol. 


1 13-0462 
logig (G.M.) = ry (Zlogig x)= ee = 1.630775 
=1-6308 
G.M. = 42°74 
1 
cae Ae by 1] 0-3762 
EHiM: oS 
n 8 
H.M. = 21:27 
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Ex. 1-18. Find the geometric mean and harmonic mean for the data of Ex. 1-4. 


0-778151 0-166667 4:668906 1-006002 


1-000000 0-100000 10:000000 -| 1:000000 
1-:146128 0:071429 20-630304 1-285722 
1-255273 0-055556 37:65819 1-666680 
1-342423 0:045455 20-136345 0681825 
1-414973 0-038462 16:979676 0-461544 
1-477121 0-033333 14-:77121 0-333330 
1-531479 0-029412 9-188874 0-176472 
1-579784 3-159568 0-052632 


137-193073 | 6:°658207 


rf (i 
logig (G.M.) = i — ’ 
sg a ANS ro sges= 1.9587 
109 
GM. = 18:14 
£ 
and aaa (x4) 6.658207 
H.M So 
N 109 
109 
M. = ———— = 16-37. 
HLM. = 6658207 


Ex. 1-19. Jn previous Ex. find quadratic mean. 
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Ex. 1-20. Uf g, and g, b 
the geometric mean of the sei 


Sol. Let %1,%2---X,, and 


Then, 


and 
Let g be the G.M. of the 


Then, 


Ex. 1-21. Uf x), x2...) 
H.M. ‘H’, show that 


Sol. 


and 


From inequalities 


Also aaa 
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alt rsa esa cain a ee 
FREQUENCY DISTRIBUTION AND MEASURES OF CENTRAL TENDENCY 15 


- the data of Ex. 1-4. 


Ex. 1-20. If g, and g, be the geometric means of two series of n, and ny items. Find 


the geometric mean of the series obtained on combining them. 


.(logi9 *) 
Sol. Let X1>*2-+-%n, and Yj, ¥2--Yn, be the items of two series respectively. 


4-668906 1-:006002 

0:000000 -} 1-000000 ln, 

0-630304 | 1:285722 Then, aie {exam f 

37°65819 1:666680 ‘ia: 

0:136345 | 0-681825 and 82 = {y4-¥a0--Iing | : 

6:979676 0-461544 : ; 

477121 0:333330 Let g be the G.M. of the combined series. 

9-188874 0-176472 alse 
3-159568 0052632 Then, g= (x1.x, veseeeee Xn Vy Yarrrreee Yn, Jats 


37:193073 | 6658207 


= [x12 2-%y,) O1-Ya2---In, 


: 1 
= {8i" fig brut 


= 1-2587 


ny ny 
= (9, m+n (g) m+n 
Ex. 1-21. If x,,%...x, be non-zero positive numbers with A.M. ‘A’, GM. ‘G’ and 
H.M. ‘H’, show that 


A2G2H. 
Xy tXq tect X 
Sol. A= eee = (xy .2q000%q) 
| (3 1 | 
+ tt 
1 x) x x 
and 7 2 1 2 . n 
\ From inequalities 


| ‘ 
| A>G 
oe 1 
— + — Ht .. tO 1 | ln 
Also [3 a ee ia 
n Xy XX, 
| 1.1 
| —2—orG2H 
| H G 
| A>G>H 
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n- 


Ex, 1-22. A variate takes values a, ar, ar’....ar"' each with Srequency unity. Show 


a(l—r” act 1=pr™! 
( and G.M. ‘G’is ar ? andthe H.M. ‘H’ is Ria il Prove 


that A.M. ‘A’ is 
nil—r) j—-r" | 


that AH = G’. Prove also that A>G =H. 


_ al+rtr?+..tr™!) _a(l-r") 


Sol. A 
n n(l—r) 
142+.tn-1 
G =(aar.ar’,...,ar"!)"" =ar  " 
n-l 
=ar ? 
fee 
1 1 1 es aes a 
= — jf lte— tee tt =—-: 
H na r r- rr"! na 1-2 
Fe 
Hara =r)" 


1-r" 
0 AH. sar"! =G?. 
(iii) Partition values : These are the values of the variate which divide the total frequency 


into a number of equal parts. Some important partition values are quartiles, deciles, 
percentiles, quintiles, etc. For a grouped dist these are given by 


.N 
age oe 
Q; =L+ ; h i=1,23 


Q, is known as median. 


N 
i478 

ss aaa I f=, 2...9 
kee 

a ee aad k =1,2...99 
jee 

and! quintile = L+;—-—-}-h 1=1,2,3,4 
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where L = Lower limit c 
Width of the « 
f = Frequency of 

c = Cumulative f 
which partiti 

N = Total Frequei 

Ex, 1-23. Find out the 
25, 15, 23, 40, 27, 

Sol. Items arranged in a 


> 
ll 


N 


we 


If n is the number of ite 


+] 


n 
Median = size of (a 


; 9+ 
= size of { 5 


= 25. 
Ex, 1-24. From the dat 
Sol. Given figures arrai 


—” 


Median = value of [ 


value of 9:: 


valet? 
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th frequency unity. Show 


,., antl ~r)yr"! 
is ————,—_. Prove 
l-r 


divide the total frequency 
s are quartiles, deciles, 


FREQUENCY DISTRIBUTION AND MEASURES OF CENTRAL TENDENCY 17 


where L = Lower limit of the class in which partition value lies. 


h Width of the class. 
f = Frequency of the class. 
c Cumulative frequency upto and including the class preceding the class in 
which partition value lies. 
N = Total Frequency. 
Ex. 1-23. Find out the median of the following items : 
25, 15, 23, 40, 27, 25, 23, 25 and 20. 
Sol. Items arranged in ascending order of magnitude : 


15 


If n is the number of items, 


; n+1)" : 
Median = size of > item 


th 
= size of (3) = =| item 


= 25. 
Ex. 1-24. From the data of Ex. 1-3 find out the median and Quartiles. 
Sol. Given figures arranged in ascending order are . 


1 
2 
3 
4 
5 
6 
7 
8 
9 


th 
Median = value of (=) item 


value of 9-5" item 


value of 9" item + value of 10" item 
2 
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= 21422 _ 43 41.5 
2 2 
1841)" 
QO, = value of 7 item 


= value of (4:75)" item 
3 
= value of 4" item + a (value of 5‘ item—value of 4" item) 
3 
= 15+—(15-15)=15 
4 
3 th 
Q; =.. value of sue item 


= value of (14:25)" item 


1 ' 
= value of 14" item + z (value of 15" item—value of 14" item) 


= 33-42 (35-33) 


= 33-5. 
Ex. 1-25. From the data given below, find out the median and the two Quartiles : 
WagesinRs. : 20. 21 22.223 24 25 26 27 28 
No. of workers: 8 10 11 16 20 25 15 9 6 


Sol. 


: Frequency Cumulative 


120+1)" 
Median = value of 2 item 


value of (60:5)" item 


value of 60" item + value of 61" item 
2 
From the above table, there are 45 items upto 23 and 65 items upto 24. 
-. Value of item from 46" to 65" is 24. 
. Value of 60" and 61° items each is 24. 
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24424 _ 


ses dian = 
Median 7 


Q, = value of( 


= value of (2 


= value of 3! 


- 3++¢ 
4 
= 23 


Q, = value of : 


= value of (¢ 


= value of 9 


7 54+20 
4 


= 25-75. 
Ex. 1-26. Find out the m 
following data : 
Monthly Rent ir 
20— 40 
40— 60 
60— 80 
80—100 
100—120 
120—140 
140—160 
160—1i80 
180—200 
Sol. 


Class-interval 


20— 40 
40— 60 
60— 80 
80—100 
100—120 
120-—140 
140—160 
160—180 
180—200 
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of 4" item) 


ie of 14 item) | 


d the two Quartiles : 
5 26 27 28 
5 15 9 6 


nulative 
2quency 


s upto 24. 
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24424 
“. Median = ————— = 24 
2 
h 
12041)" | 
Q, = value of FI item 


= value of (30:25)"" item 

= value of 30" item + (value of 315 item — value of 30" ties) 
= 23+ ; (23 -23) 

= 23 


3- 
Q; = value of 7 (120+1)" item 


= value of (90-75)" item 


3 
= value of 90" item + mA (value of 91% item — value of 90" item) 


= 25 += (26-25) 


25:75. 
Ex. 1-26. Find out the median, quartiles, 3rd quintile, 5th octile, 7th decile from the 
following data : 


Monthly Rent in Rs. No. of families paying the Rent 
20— 40 6 
40— 60 9 
60— 80 11 
80—100 14 
100—120 20 
120—140 15 
140—160 10 
160—180 ~ 8 
180—200 7 


Sol. 


Cumulative 
Frequency 


20— 40 
40— 60 
60— 80 
80—100 
100—120 
120—140 
140—160 
160—180 
180-—200 


ARC arp Ee 519-5071 *) 4 


20 


100 th 
The Median = value of (2) item 


2 


which lies in 100—120. 


Applying interpolation formula, 


120 - 
Median = 100+ 


= 100+10=110 


value of 50 item 


(50 — 40) 


100 \ 
Q, = value of (2) item 


4 


»= value of 25" item 


which lies in (60-80) 


Q, = 60 + 80-60) 
1 


20 
60 + — (10 
iS ) 


200 


{25 — 15} 


COs ms ls2 2 78:2 


3 
Q; = value of 5200)" item 


= value of 75" item 


which lies in 120—140 


140-120 
(OF=190 += 
Q; 15 


(75 — 60) = 140 


3 
34 quintile = value of 3 (100)"" item 


= value of 60" item 


which lies in 100—120 


120-1 
. 3 quintile = 100 + Se (60 — 40) = 120. 
th : 5 th . “en 
5" octile = value of ri (100) item e 
= value of 62:5 
which lies in (120—140) © 
» 5% octile = 120+ “ (62-5 — 60) 


= 12045 (2-5) = 123-3 
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7 
7% Decile = value of rr 
= value of 70 
which lies in 120—140 

140 
-. 7 Decile = 120+ 3 
= 120+ 4 (1 

3 


Ex. 1-27. Find the median j 
Monthly Wages (in Rs.) 50—S 
No. of Workers : 6 

Sol. 


Class Intervals 


ll th 
Median = value of (44) 


= value of 55:5" ite 
which lies in 65—70 
70 — 6: 
30 


65+ 


.. Median 


65+ S (17::! 
. 6 
Ex. 1-28. Find the Median . 


pol Class Intervals 


100—200 
200—300 
300—400 
400—500 
500—600 


th 


Median = value of it 


= value of 50" item 
which lies in 300—400 
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7 
7" Decile = value of Th (100) item 


= value of 70" item 
which lies in 120—140 


“. 7 Decile = 120+ a (70 - 60) 


120 + : (10) = 133-3. 


Ex. 1-27. Find the median for the data : 
Monthly Wages (in Rs.) 50—55 55—60 60—65 65—70 70—75 75—80 80—100 
No. of Workers : 6 10 22 30 16 12 15 


Sol. 
Frequency Cumulative 
Frequency 


Class Intervals 


lir\: 
Median = value of > item . 


= value of 55-5" item 
which lies in 65—70 
70 - 65 
30 


Ml 


.. Median 65 + (55-5 — 38) 


65+ = (17-5) = 67-92. 


Ex. 1-28. Find the Median and Quartiles from the data of Ex. 1-5. 


Frequency 
100—200 15 15 
200—300 18 33 
300—400 63 
400—500 
500—600 


th 


Median = value of item 


= value of 50" item 
which lies in 300-—400 


Uh et i Seale 
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22 
(iv) Mode: /t is that value o; 
«Median = 300+ ae 23) (50 — 33) dist it is given by 
100 Mode = | eat ae 
= 300 + 3 G7) = 356-67 ] 2fn-h-S; 
th ( Mode = Lt i h 
Q, = Value of re item fith 
= Value of 25" item Second Method is used whei 
which lies in 200—300 | where I = Lower limit of th 
300 — 200 | tin = Frequency of m 
Q; = 200 + —————- (25 - 15) i She : 
18 1 quency of th 
100 fy, = Frequency of the 
= 200+ 13 (10) = 255-55 For a moderately skew distr 
; ; Mode = 3 Median — 21 
3 ba ; Ex. 1-30. From the data of t 
Q,= Value of |] (100)| item Sol. Converting the data intc 
Marks F 
= Value of (75)" item 1 " : 
which lies in 400—500 . 2 
500 — 400 : 15 
Q,= 400+ age ee (75 - $) 7 
= 400 + 5(12) = 460 18 
Ex. 1-29. Compute the Median of data in Ex. 1-7. 20 
Sol. 21 
; In the table on next page, tl 


columns (2) and (3), then in three’ 

(10) and in fives in columns (11) 

100—119 column is indicated by putting a s 

120—139 To find out the point of maxi 
table below : 


140—159 
160—179 
180—199 
200—239 
240—259 


2 


= Value of 35" item 


which lies in 160—179. 
A$ class intervals are of inclusive type, the real limits of the 


nea ag ns 8 ances 


70\" 
Median = Value of (= item 


group are 159-5 to 179°5. 


159-5 Ss = 160-83 
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(iv) Mode : It is that value of the variate for which frequency is maximum. For grouped 
dist it is given by 


Mode = L+ = e Fh h (I Method) 
i Mode = L+ fe h (II Method) 
\ fith 
Second Method is used where first fails, 
where L = Lower limit of the class in which mode lies i.e., modal class. 
Sm = Frequency of modal class. 


f, = Frequency of the class preceding the modal class. 


fo = Frequency of the class following the modal class. 


For a moderately skew distribution mode is given by 
( Mode = 3 Median — 2 Mean 


| Ex. 1-30. From the data of Ex. 1-3 find the mode : 
Sol. Converting the data into ordinary Frequency dist : 
Marks Frequency Marks Frequency 

f 11 2 22 1 

12 1 23 1 

15 2 a2, 2 

17 1 33 1 

18 1 35 2 

20 1 38 1 

21 1 4} 1 
In the table on next page, the frequencies in column (1) are first added in two’s in 
Cumulative columns (2) and (3), then in three’s in columns (4), (5), (6), in four’s in columns (7), (8), (9), 


(10) and in fives in columns (11), (12), (13), (14), (15). The maximum frequency in each 
column is indicated by putting a sign ‘/” above the figure. 

To find out the point of maximum concentration the data can be arranged in the shape of 
table below : 


Frequency 


. 


Analysis Table 


EEN EN SEVERE RETESET 


Y lv ilVviv 
Y iv {viv 
Yl |v 
f the group are 159-5 to 1795. J J Raped dew 
v YViv|Y%i4|viv 
Yilviv v 
Yi4iviv 


(Contd.) 


24 


bo 
= 
ce 
3 
2 |. 
Q 
z 
o 
3S 
S 
ml 
3 
=| 
° 
3 
s 
eo 
r= 
= 


Frequency 
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In this table marks are taker 
vertical. Since according to colu 
row (1) the signs ‘W’ are put und 
(2), mode should be either 11 or 
‘J’ are put under 11, 12, etc. 

In this way the whole table i 

Since value 32 occurs the la 

Ex. 1-31. Following is the d 

from a district. Calculate the mo 
Central size 

(ina 

I 

2 

34 

4 

5( 

6( 

7 

Sol. Since central size increz 
of size 10. Hence various class-i 

Model class is 35—45. 


Mo 


which lies outside the class-inten 
“. The formula fails. 
In such cases we use the secc 


Mc 


By this formula, 
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fn An oe eA v 
v 
in An oe Ae 
v V Yio |v iv 
v 
Y |v 
Y |v iv 


| ts | 7 [0 CER EN ER ESRC CCC 


In this table marks are taken along the horizontal and columns (1), (2), efc., along the 
vertical. Since according to column (1), mode should be either 11 or 15 or 32 or 35, in the 
row (1) the signs ‘Y’ are put under 11, 15, 32 and 35. Similarly, since according to column 
(2), mode should be either 11 or 12 or 15 or 17 or 23 or 32 or 33 or 35, in row (2) the sign 
‘/’ are put under 11, 12, etc. 

In this way the whole table is completed. 

Since value 32 occurs the largest number of times, mode is 32. 

Ex. 1-31. Following is the distribution of the size of certain farms selected at random 
from a district. Calculate the mode of the distribution : 


XN NON 
NNN 
NS NSN 


Central size of the farms No. of farms 
(in acre) 
10 8 
20 12 
30 17 
40 29 
50 31 
60 5 
70 3 


Sol. Since central size increases by 10 throughout the data, each class interval must be 
of size 10. Hence various class-intervals are 5—15, 15—-25 etc. (See Table on page 26). 
Model class is 35—45. 
29 ~17 


= 35+————_- (10 
Model | 58-17-31" ) 


= ee ay 
10 


which lies outside the class-interval 35—45. 
«. The formula fails. 
In such cases we use the second formula, i.e., 


fa 
fith 


Mode = L +——=— A 


By this formula, 


Mode = 35+ (10) = 41.46, 


31 
17 +31 
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Ss-PS€ = (001) 


CECE CACC 


07 — 81-09 


+ = opoy *- 
8I—-0£ OE 


O00F—ODE ST SseID [Epo] *- 


Li 
Le 
07 
0s 
- 0€ 
8b . c 
- 8I 
ce 
ST 


009-—00S 
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00€—007 


00¢—001 
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Ex. 1-33. Find the mode from the data of Ex. 1-7 


e 
8 
3 
Ss 
~~ 
ay 


aN 
\o 
—, 
os 
va) 
— 
os 
+ 
~— 
—_~ 
ina) 
—_ 
on 
N 
— 
a) 
— 
os 
XO 
— 
a 
Ww 
~ 
ms 
9 
— 
—~ 
loa) 
— 
—~ 
N 
— 
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.. Modal class is 140—15' 
i.e., 139-5—159-5 


Ex. 1-34. If the mode and 

16 inches and 20-2 inches. Con 

Sol. For a moderately asyr 
mode is 

N 


Ex. 1-35. If the mode andi 
16” and 15-6", what would be | 


Sol. i 
N 


Me 


Ex. 1-36. The dist. x,,x2.. 


X,,X...X, with the same coi 


‘a’ and ‘b’ are constants. Show 
of the first dist by the same trai 


Sol. By given, 
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.. Modal class is 140—159 
ie., 139-5—159-5 


16-10 
. = 139-5 +——_—__ (20 
be * Mode 32-10-15 | ) 
= 139-5 + C0) _ 156.64, 


Ex. 1-34. If the mode and mean of a moderately asymmetrical series are respectively 
16 inches and 20-2 inches. Compute the most probable median. 
Sol. For a moderately asymmetrical series, the relation connecting mean, median and 


mode is 
Mode = Mean - 3 (Mean — Median) . 


= 3 Median — 2 Mean 


1 
Median = 3 (Mode + 2 Mean) 


= 5 (16 + 40-4) 


= == = 18-8 inches. 


Ex. 1-35. If the mode and mean of a moderately asymmetrical series are, respectively 
16” and 15-6", what would be its most probable median. 
Sol. Mean = 15-6” 


Mode = 16” 


~ ] 
Median = 3 (mode + 2 mean) 


tl 


1 
— (16+ 31-2 
3 ) 


—™ CE ae 
7 47-2 
= = — =15-73 
3 
SSN 
Ex. 1-36. The dist. x,,x...x, with frequencies f,, fy... f,, is transformed into the dist. 
> (ona) ioe) : d 
X,,X...X,, with the same corresponding frequencies by the relation X = ax +b where 
‘a’ and ‘b’ are constants. Show that the mean, median and mode are given in terms of those 
of the first dist by the same transformation. 
Sol. By given, X =axt+b 
2 Bf Bg 
rt ton) “ X; = ax, +b 
2 —) r—) 
oO S + 
ham N N 


l< lx 
: a7) — X=—) : +b 
‘ < x wal iN eae 
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ix 1x 
= a-— x +b—)> ; =axt+b 
Neat Neti mr 


where X and x are A.Ms. 

Since for median the value of the variable corresponding to the middle item is to be 
obtained and this middle item remains middle item in the transformed dist, median of the 
transformed dist is given by same transformation. 

Similar argument holds for mode. 

Ex. 1-37. From the following data find the missing frequency : 

No. of Tablets : 4 — 8 — 12— 16—20— 24— 28 — 32— 36— 40 

No. ofpersons cured: 11 13 16 14 — 9 17 6 4 

The average number of tablets to cure fever was 19-9. 

Sol. Let the missing frequency be ‘a’. 


- Now A.M. of tablets for all the persons = 19-9. 
. Total number of tablets for all the persons 


= 19-9 (90 + a) 
= 19-9a+1791 
ate 19-9a + 1791 = 22a + 1772 
or 21a = 19 
or a=9-0. 


Ex. 1-38. The following table gives the marks obtained by 30 students of a class in 
certain paper : 
Digits (Division) of class intervals 


Calculate the mean and median of the series : (a) by using only the total of class- 
intervals, (b) by using the entire data. 

Sol. The data indicates that 4 students get 46 marks, 3 students get 54 marks, 3 students 
get S57 marks and so on. 


Calculation of Mean 
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(xoidde) {yp = 


SOSSE[D JO STR}IO} BuIsE) (v) : URI] 


or Tool = [mf De [aro [ol 


JUapHys YIva 10} SYAVUI [eJO], 


uvay Jo uonENayeD 


sformed dist, median of the 

30 students of a class in 
ing only the total of class- 
nts get 54 marks, 3 students 


. 
, 


b 
to the middle item is to be 
| — 28 — 32 — 36-—40 
6 


ney: 
wy 
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(b) Using entire data. 


Du* 1409 ae gy . 


n 


= 47 (approx.) 
Median 
(a) Using only totals of class intervals. 


30 
Median has a =15 items below it i.e., it lies in 40—49 i.e., 39-5—49-5 (taking real 


limits). 
; 10 
Median = 39-5 + Ci (15 - 10) 
= 39-5 2 = 46-64. 
(b) Using entire data. 
F 30+1 
Median = value of 5 th item 


= value of 15-5th item 


value of 15th item + value of 16th item 
2 

From Cumulative frequencies it is clear that 15th and 16th items lie in 40—49. There 
are 10 items upto 39. : 

So counting in the group 40—49 the various items we see that 

15th item = 46 = 16th item 

ity Median = 46. 

Ex. 1-39. Show that in finding the arithmetic mean of a set of readings on a thermometer, 
it does not matter whether we measure the temperature in centigrade or Fahrenheit degrees, 
but that in finding the G.M. it does matter. 


Sol. Let a set of N thermometric readings in Centigrade degrees be C,, Cp........... Cy 


and the corresponding readings in Fahrenheit degrees be Fj, F........... Fy- 
The relation between Centigrade and Fahrenheit readings is 


F=32+ Z¢ : 
5 
where C corresponds to Centigrade readings and F to Fahrenheit readings 
FL, =32+ =C, r=1,2....N. 


Now the A.M. of the NV readings in Centigrade degrees 


_ GAG tn tly _Feayy 
N 
and the same in Fahrenheit degrees 
Fo+ Fyt...4F = 
= EN SF (say) 


N 
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SS ell 

PON 

a 

~ ON 

= 32 

= Fahrenheit equivaler 


G.M. of the readings i 
= (F 


“| 


.. Fahrenheit equivale 
= 32 


But (1) and (2) are not 
”. Fahrenheit equivale 


G.M. of the Farenheit read: 


.. The given statemen 
Weighted Average. // 


Weighted arithmetic m 


Weighted geometric m 


and Weighted harmonic m 


Ex. 1-40. Show that 1 


weights are equal to the cc 


Sol. Weigl 
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e., 39:5—49-5 (taking real 


lue of 16th item 


tems lie in 40—49. There 


iat 


eadings on a thermometer, 
ide or Fuhrenheit degrees, 


readings 


ay) 


y) 
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F= ~ (F + Fyt...+Fy) 


Il 
| 
— 
CaN 
Ww 
N 
+ 
| 
a9) 
— 
+ 
aa. 
Ww 
NO 
+ 
rf .o 
Q 
N 
Ny. 
+ 
+ 
) aI 
WwW 
NO 
+ 
i) 
i) 
4 
ee 
SY 


3242. Cy + Cz +...4Cy =-324.26 
5 ‘N 5 


= Fahrenheit equivalent of C (A.M. of Centigrade readings). 
G.M. of the readings in Fahrenheit 


= (F,, Fy....Fy) 


UN 
= {(2+3e1](22-3c.}..(a2+2c, } .(1) 


G.M. of the readings in Centigrade 


me CA oer oF) has 


.. Fahrenheit equivalent of the geometric mean of the readings in Centigrade 


9 
= 32+ 2 (CiCaCy ...(2) 


But (1) and (2) are not same. 

. Fahrenheit equivalent of the G.M. of the Centigrade readings is not the same as the 
G.M. of the Farenheit readings. , 

.. The given statement follows. 


Weighted Average. Jf w,.wp....... w,, be the weights of values x,,X....x,, then 


WX, + W2X24....+W Xp 


Weighted arithmetic mean = Wy + Wo tou + Wy 


w, log x, +....+ w, log seh 
WwW) +... FW, 


Weighted geometric mean = Antilog 
and Weighted harmonic mean = Reciprocai Wp t...tw 


Ex. 1-40. Show that the weighted arithmetic mean of first n natural numbers when 


2n+1_ 


weights are equal to the corresponding numbers is 


1:142-24+....4a.n 


Sol. Weighted A.M. = eo eee 
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1243" an? 


= n(n+1) 
2 
_ n(n+1)(2n+]) © 2 
7 6 n(n+1) 
2n+1 ; 


3 
Ex. 1-41. From the following results of two colleges A and B find out which of the two 
is better : 


A B 
Exam. Appeared Passed Appeared Passed 
M.A. 100 90 240 200 
M.Sc. 60 45 200 160 
B.A. 120 75 160 100 
B.Sc. 200 150 200 140 
Total 480 360 800 600 


Sol. Since the number of students appearing for M.A., M.Sc., B.A. and B.Sc. widely 
differ, simple arithmetic average of pass percentages of the college will not give the correct 
idea of pass percentage of a college for all the examinations taken together. So we take the 
weighted average of pass percentages, weights being the number of students appeared for 
each examination. 


For college A, 
Pass percentage for M.A. = 90% 


45 

Pass percentage for M.Sc. = 60 x 100 = 75% 
75 

Pass percentage for B.A. = 10 x 100 = 62:5% 
150 

Pass percentage for B.Sc. = 300 x 100 = 75%. 


For college B, 
200 250 


A. = —— x100=——% 
Pass percentage for M.A 740 3 fo 


160 
Pass percentage for M.Sc. = 300 x 100 = 80% 


100 
Pass percentage for B.A. = 160 x 100 = 62-5% 


140 
Pass percentage for B.Sc. = 300 x 100 = 70% 


ce 
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No. of student: 
appeared 
(weight) w 


“. Pass percentage of col 


Pass percentage of col 


Since pass percentages fc 
Ex. 1-42. Find the weigh 
Group 

Fuod 
Clothing 
Fuel and lis 
House Ren 
Miscellane: 


Sol. 


Food 


Clothing 13 
Fuel and light 14 
House Rent 17 


Miscellaneous 


.. Weighted geometric n 


Ex. 1-43. A train starts 
average speeds of 12, 16, 24 
19-2 k.m. per hour and not 2. 
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' find out which of the two 


B 
: Passed 
200 
160 
100 
140 
600 


»., B.A. and B.Sc. widely 
e will not give the correct 
1 together. So we take the 
of students appeared for 
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ee” 


No. 4+ Students —_—_4__| of students 
(69) appeared es % . Appeared 
Pass % (weight) w (weight) w 


250 


36000 
.. Pass percentage of college A = "as0 = 75% 


60000 
Pass percentage of college B = = aG0° 75% 


Since pass percentages for two colleges A and B are same, none is better than the other. 
Ex. 1-42. Find the weighted geometric mean from the following data : 


Group Index No. dies 
Fuod 125 7 
Clothing 133 5 
Fuel and light 141 4 
House Rent 173 1 
Miscellaneous 182 3 
Sol. 
Weight (¥ 10g, 2 
Food 125 7 2:0969 14-6783 


Clothing 10-6195 


8:5968 


Fuel and light 


" House Rent 2:2380 


Miscellaneous 6°7803 


42-9129 


.. Weighted geometric mean is given by 


42-9129 
log,, G = ———— _ = 2-1457 
10 20 
G =139-8 = 140. 


Ex. 1-43. A train starts from rest and travels successive quarters of a kilometre at 
average speeds of 12, 16, 24, 48 k m per hour. The average speed over the whole k m is 
19-2 k.m. per hour and not 25 k.m. per hour. Explain. 
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Sol. Here average speeds for each quarter of a k.m. are given. To find out the average 
speed over the total distance, first the total time taken by the train is to be calculated by 
dividing distances by average speeds and then the total distance is to be divided by the total 
time. This procedure is equivalent to finding the weighted harmonic mean of average speeds 


weights being the respective distances. 


1 
Here distances travelled in four cases are same each being equal to 4 k.m. So here 


equal weighted or simple harmonic mean is the appropriate method of averaging. 
“. Average speed over the whole mile 


1 
“sh ft tb ab Od 
Jee + . + -_—-_ + . 
412 4 16 4 24 4 48 
_ _ A448) =19-2. 
4+34+2+1 


. Average speed over the whole k.m. is 19:2 k.m.p.h. and not 25 k.m.p.h. which is 


l ‘ 
simply the A.M. or weighted (weights being a each) A.M. of four average speeds 12, 16, 


24 and 48. 
Ex. 1-44. A cyclist covers his first three k.m. at an average speed of 8 k.m.p.h., another 


two km at 3 k.m.p.h. and the last two k m at 3 k.m.p.h. Find his average speed for the entire 


journey. 
Sol. The average speed for the entire journey is the weighted harmonic mean of the 


speeds with distances as weights. 
.. The average speed for the entire journey 


= 34242 ae! 
zgaty.t ae, 
8 3 3 8 3 
Se any 
9+32 41 


Ex. 1-45. Mr. X travels from A to B at an average speed of 30 k.m.p.h. and returns from 
. Bto A atan average speed of 60 k m per hour. Find the average speed of Mr. X for the entire 
trip. : 
Sol. Let x be the distance between A and B 
Then average speed for the entire trip 


2 2x _ (2) (60) ‘ 
page “eel 
30 60 
= 40. 
.. Average speed for the entire trip 
= 40 k.mp.h. 


Ex. 1-46. An aeroplane flies round a square the sides of which measure 100 km each. 
The aeroplane covers at a speed of 100 km per hour the first side, at 200 k.m.p.h. the second 
side, at 300 k.m.p.h. the third side and 400 k.m.p.h. the fourth side. What is the average 
speed of the aeroplane around the square? 
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es 


Sol. Average spe: 


Ex. 1-47. You take a trip which 
of 60 k.m.p.h., 3000 km. by boat a 
350 k.m.p.h. and finally 15 km, by 
entire distance (4315 k.m. )? 

Sol. The average speed 


Ex. 1-48. A man travels 50 km ai 
K.m.p.h. What is his average speed fc 
Sol. Average speed for the whol 


Ex. 1-49, The price of a commodi: 
to 2000 and 77% from 2000 to 2001. ; 
26% and not 30%. Explain this staten 

Sol. Let the price of the commod 
beginning of 1999 


Since the price from 1999 to 20( 


2000 = 108 105 
100 100 
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given. To find out the average 
he train is to be calculated by 
nce is to be divided by the total 
tmonic mean of average speeds 


1 
eing equal to 7 k.m. So here 


method of averaging. 


—_ =a 


and not 25 k.m.p.h. which is 


of four average speeds 12, 16, 


ze speed of 8 k.m.p.h., another 
is average speed for the entire 


righted harmonic mean of the 


f30k.m.p.h. and returns from 
ze speed of Mr. X for the entire 


60) * 


‘which measure 100 km each. 
de, at 200 k.m.p.h. the second 
rth side. What is the average 


2000 
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400 


ll 


Sol. Average speed 


1 I 1 1 
+ + + 
(45 200 300 as) 
4(1200) 
12+6+44+3 


= Saat = 192 kim. p.h. 


Ex. 1-47. You take a trip which entails travelling 900 k.m. by train at an average speed 
of 60 k.m.p.h., 3000 k.m. by boat at an average speed of 25 k.m.p.h., 400 k.m. by plane at 


350 k.m.p.h. and finally 15 k.m. by taxi at 25 k.m.p.h. What is your average speed Jor the 
entire distance (4315 k.m.) ? 


Sol. The average speed 


900 + 3000 + 400 + 15 


900-— +3000. 1. 400.—! +15. 
60 25 350 25 


4315 
1541204243 
7 5 


fs (4315) (35) 
‘ 525 + 4200 + 40+ 21 


= (4315) (35) = 151025 = 31-56 k.m.p.h. 
4786 4786 


Ex. 1-48. A man travels 50 km ata speed of 20 k.m.p.h. and then returns at speed of 30 
k.m.p.h. What is his average speed for the whole journey ? 
Sol. Average speed for the whole journey 


50 + 50 
50-L + ae 
20 30 


= (20) = 24 k.m.p.h. 
3+2 


Ex. 1-49. The price of a commodity increased by 5% from | 998 to 1999, 8% from 1999 
to 2000 and 77% from 2000 to 2001. The average increase from 1998 to 2001 is quoted as 
26% and not 30%. Explain this statement and verify the arithmetic. 

Sol. Let the price of the commodity in the beginning of 1998 be x. Then price in the 


beginning of 1999 
105 
=) te. 
100 
Since the price from 1999 to 2000 increases by 8%, the price in the beginning of 


108 105 
eer 
100 100 
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Similarly, the price in the beginning of 2001 


se) 
100 /\100 100 
.. The price at the end of 2000 or in the beginning of 2001 
(105) (108) (177) 
“doy 
Let r be the average rate of increase. Then 
(105) (108) (177) ‘ 


100)? =x(l+r) 


1 
1 = — 3/(105) (108) (177 
or +r 00 (105) (108) (177) 


logig(1+r) = 5(-6+2-0212 +2-0334 + 2-2480) 


5 (0-3026) =0-1009 


1l+r = 1-262 


r = 0-262 or 26%. 
a Average price rise was 26%. 
The A.M. of the rise in price is ‘ 
5+8+77 _ 
3 


30%. 


3 
l 
If this be the rise in price in each year, the price at the end of 2000 would be (=) x 


which is much higher than the value obtained from the given datai.e., *| 199 }\ 100 } | 100) ' 
But if the rate of rise in price is taken to be 26%, the price at the end of 2000 would be 


3 
126 
x (=) which is nearly equal to the value obtained from the given data. 


. Average price rise was 26% and not 30%. 

Ex. 1-50. Find the average rate of increase in population which in the first decade had 
increased 20%, in the next 30% and in the third 45%. 

Sol. Let x be the population in the beginning. Then the population at the end of first 


) 120) 
decade = 100}. 


Since the population in the next decade increases by 30%, the population at the end of 


130) (120). 
second decade = 100 | 100} 
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Similarly, the population at 


Let r be the average rate of 
Then 


Ex. 1-51. machine is asst 
second year and 10% per annur, 
on the diminishing value. What 

Sol. Let x be the value of tt 

Then price at the end of fir: 


The price at the end of 2nd 


The price at the end of fifth 


Let r be the average rate of 


(75) (60) (90)° ‘ 
(100)° 


en 


logig (1 


(1 
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See aelakteierichereeeieatse eae 


77). 
oo) 


2001 


i x(l+r)? 
177) 
-0334 + 2-2480) 


)09 


130)", 
end of 2000 would be 100 


| s2 
dataie., * 100 }\ 100 } 100 }" 


ice at the end of 2000 would be 


m the given data. 


tion which in the first decade had 


the population at the end of first 


30%, the population at the end of 
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Similarly, the population at the end of third decade 


145))(130)(120) | 
~ (100 /\ 100) 100)” 
Let r be the average rate of increase per year. 


Then 


(145) (130) (120) 


(100)° x= x(1 +r)? 


logio (en=— {-6 + 2-1614 +2-1139 + 2-0792} 


+ (0.3545) = 0-0118 
30 


l+r = 1-028 


r = 0-028 or 2-8% = 3%. 


Ex. 1-51. A machine is assumed to depreciate 40% in value in the first year, 25% in the 
second year and 10% per annum for the next three years, each percentage being calculated 
on the diminishing value. What is the average percentage depreciation for the five years ? 

Sol. Let x be the value of the machine in the beginning. 

Then price at the end of first year 


(0 
= |/—7/|%. 
100 


The price at the end of 2nd year 


- (ss) ioo)* 


The price at the end of fifth year 


~ Gi) GG) 


Let r be the average rate of depreciation per year. 


3 
" (75) (60) (99) Seen 
(100) 
logio (1=r) = (1041-8751 + 1-782 + 3(1-9542)) 
= g (10-41-8751 +1-7782 + 5-8626) 
= 1-9032 
“ (l-r) = 0-8002 
=> r = 19-98% 


= 20%. 


40 
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Ex. 1-52. The age distribution of the members of a certain children’s club is as follows : 


Age on last birthday (in yrs.) 


Frequency 
5 
9 
18 
35 
42 
32 
15 
yi 
3 


There is a member A s.t. there are twice as many members older than A as there are 
younger than A. Estimate his age (in years upto two places of decimals). 


Sol. Since ages on last birthday are given, 

No. of persons who are in 4—5 group = 5 

No. of persons who are in S—6 group = 9 
and so on. 


Frequency (f) 


th 
Age of A = size of () item 


1 th 
= size of & ;] item 


which lies in 7—8 


8-7 


ll 
~l 
+ 

w 
Bir 
FN 
ie) 
Ww 
wile 
ee 


| 
~ 


ll 
~ 
on 
= 


Cumulative 


FREQUENCY DISTRIBUTION ANI 
— eee 


Ex. 1-53. An incomplete f 
Variate : 10—20 20—30 


Freq. : 12 30 


Given that median value i: 
Sol. Median = 46 lies in 4: 


. 


where x is the freq. of the class 
. x = 33-5 >= 34, 
.. Freq. of class 50—60 = 


Find arithmetic means of foll« 
1. Gold output (in millions o: 
94 95 96 
78 82 83 


2. x:0 1 2 
fil 9 26 
where x denotes the numt 
tossed 256 times. 


3. Age Group No. of pe. 


25—30 1 
30—35 2 
35—40 4 
40—45 10 
45—50 21 


4. Find A.M. of data in Ex. 1-' 


5. Wes (in Ibs.) Freq. 
90—100 10 
100-110 37 
110—120 65 
120—130 80 

6. Wage (in Rs.) No, of E} 

50—55 2: 
45—50 3( 
40—45 4C 


35—40 4s 
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children’s club is as follows : 
“requency 


rs older than A as there are 
‘decimals). 


nulative 


: 
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Ex. 1-53. An incomplete freq. dist. is given below : 
Variate: 10—20 20—30 


Freq. : 12 30 65 ? 25 


30—40 40-50 50-60 60—70 70—80 Total 
? 


18 229 


Given that median value is 46, find the missing frequencies using the median formula. 


Sol. Median = 46 lies in 40—50 class 
229 | 


We (x+ 42) 
a 46 = 4041042 


where x is the freq. of the class 30—40. 
*, X= 33-5 ~ 34, 


.. Freq. of class 50—60 = 229 — (Sum of remaining frequencies) 
= 229 — 184 = 45 


EXERCISES 
Find arithmetic means of following datas : 
1. Gold output (in millions of pounds) for different years. 
94 95 96 93 87 79 73 69 
78 82 83 89 95 103 108 117 


2. x:0 1 2 3 4 5 6: 7 
fii 9 26 59 72 52 29 7 


68 67 
130 97 
(Ans. 90:15) 
8 


I 


where x denotes the number of heads and f their frequencies when eight coins are 


tossed 256 times. 


(Ans. 3-97) 
3. Age Group No. of persons Age Group No. of persons 
25—30 1 50—55 53 
30—35 2 55—60 126 
35—40 4 60—65 163 
40—45 10 65—70 35 
45—50 21 70—75 6 
75—80 I 
(Ans. 58-66) 
4. Find A.M. of data in Ex. 1-26. (Ans. 110) 
5. Ws (in lbs.) Freq. Wes (in Ibs.) Freq. 
90—100 10 130—140 51 
100—110 37 140—150 35 
110—120 65 150—160 18 
120—130 | 80 160—170 4 
(Ans, 125-73) 
6. Wage (in Rs.) No. of Employees Wage (in Rs.) No. of Employees 
50—55 250 “3035. 800 
45—50 300 25—30 - 1100 
40—45 400 20—25 1700 
35—40 450 


(Ans. 31.15) 


42 


7. No. of days absent 


Less than 5 
Less than 10 
Less than 15 
Less than 20 
Less thar 25 


P3 


No. of students 


8. Ei out the median from the following data : 


10. 


11. 


12. 


13. 


14, 


ge group (in years) 


No. of men 


Age group (in years) 


No. of men 


From the table given below, find out the median and the quartiles : 


15—20 
$5 
40—45 
28 


Marks No. of students 
0—S5 4 
5—10 6 
10—15 10 
15—20 10 
Size 11—15 16—20 
Freq. 7 10 
Size  41—45 46—50 
Freq. 11 5 


20—25 
9 
45—50 
6 


Marks 
20—25 
25—30 
30—35 
35—40 


21—25 
13 


MATHEMATICAL STATISTICS 


No. of days absent No. of students 


Less than 30 487 
Less than 35 493 
Less than 40 497 


Less than 45 $00 
(Ans. 13-51) 


25—30 30—35 35—40 


82 58 49 
50—55 
3 
(Ans. 32-07) 


. From the following data find out the median and the quartiles : 


No. of students 
25 
22 
18 
5 
(Ans. 24; 17-5; 29°54) 


26—30 31—35 36—40 
26 35 - 40 


(Ans. 33, 26°8, 37-9) 


Find the Quartiles, 20th percentiles and the 8th decile of hts from the following table : 


ht (in inches) 


Find out the median and quartiles of data in Ex. 7. 


No. of students 


ht (in inches) No. of students 
63 22 
64 20 
65 10 
66 8 
(Ans. 60; 63; 60 and 63) 


(Ans, 12:8; 10-02; 16-4) 


Find out the median, quartiles, 6th decile, 70th percentile and 3rd quartile for the data 


in Ex. 1-6. 


Farm size (acres) 


0—40 
41—80 
81—120 

121—160 


(Ans, 124-75; 114-3; 136-47; 128-5; 133-53 and 21-85) 
The following table gives the dist of farms according to their sizes in a given region. 
Calculate the median and the quartiles (size of the farm is rounded to the nearest acre) : 


No. of farms Farm size (acres) No. of farms 
394 161—200 169 
461 201—240 113 
391 241 and over 148 
334 


(Ans. 95-85; 49-91; 151-82) 
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15. Find mode from the follow. 
Wage (in Rs.) 20 i 
No. of workers 8 ] 


16. Find mode of data in solvec 
17. Find mode of the data in E» 
18. Find mode of data in Ex. 1. 
19, Findmode of data in Ex. 1. 
~ Find mode of data given be 

5 students get less than 3 

12 students get less than 6 

25 students get less than 9 

30 students get less than 12 

21. The consumption of petrol 
planes to a hill station and ‘ 
average would you conside 

per gallon for up and down 


22. Under what conditions wei; 
(i) equal to simple aver: 

(ii) greater than simple z 

(iii) less than simple avei 
Illustrate your answer with 

23. The following is the dist. : 
measure of central tendenc: 


Age-group No. of 
0—9 Z 
10—19 ‘ 
20—29 i 
30—39 


24. The table below shows the 
2002. 


Age No. (in mii 
under 25 2°22 
25—29 4-05 
30—34 5-08 
35—44 10-45 
45—54 9-45 
Do you think that in this ca 


mean ? Give reasons. 

25. The daily expenditure of 1( 
Expenditure : 0—10 
No. of families : 14 
The median and mode for tt 
the missing frequencies. 
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15. Find mode from the following data : 


absent No. of students 2 
Wage (in Rs.) 20 21 22 23 24 25 26 27 28 


n30 487 

n35 493 No. of workers 8 10 11 16 20 25 15 9 6 

n 40 _ 497 (Ans. 25) 
n45 500 16. Find mode of data in solved Ex. 1-4. (Ans. 17-78) 


17. Find mode of the data in Ex. 7. (Ans. 12-48) 


18. Find mode of data in Ex. 1-5. (Ans. 110-91) 


(Ans. 13-51) 


j 19. Findmode of data in Ex. 1-6. (Ans. 123-41) 
30 30—35 35—40 i ~ Find mode of data given below : 
58 49 5 students get less than 3matks > 1; = ~J- Ly re 
55 12 students get less than 6 marks ~~ | A, + A 7 
j 25 students get less than 9 marks 
(Ans. 32:07) j 30 students get less than 12 marks" (Ans. 7:29) 
files: ; 21. The consumption of petrol by a motor was ‘a gallon’ for 20 k.m. while going up from 
lo. of students planes to a hill station and ‘a gallon’ for 24. miles while coming down. What particular 
25 average would you consider appropriate for finding the average consumption in miles 
22 per gallon for up and down journey and why ? 
18 9 
5 [Ans 21— m.p.h. per gallon 
(Ans. 24; 17-5; 29-54) Ml 
quartiles: 22.’ Under what conditions weighted average is 
0 3135 36—40 (i) equal to simple average. 
35 40 (ii) greater than simple average. 


(iii) less than simple average. 
| Illustrate your answer with the help of examples. 


(Ans. 33, 26°8, 37-9) 23. The following is the dist. of 136 individuals by 10-year age groups. Calculate that 


hts from the following table : measure of central tendency which will appropriately describe the dist. 
) No. of students Age-group No. of persons Age-group No. of persons 

oD) r 0—9 48 40—49 13 

20 | 10—19 26 50—S9 4 

10 20—29 27 60—69 3 

8 30—39 11 70 and over 4 


(Median 17-2) 


(Ans. 60; 63; 60 and 63) 24. The table below shows the age dist of heads of families in country A during the year 


2002. 
(Ans. 12-8; 10-02; 16-4) Age No. (in millions) Age No. (in millions) 
2 and 3rd quartile for the data under 25 222 55—64 6°63 
25—29 4-05 65—74 4:16 

47; 128-5; 133-53 and 21-85) 30—34 5-08 75 and over - 1:66 

their sizes in a given region. 35—44 10-45 

rounded to the nearest acre) : 45-54 9-47 
‘acres) No. of farms Do you think that in this case median is a better measure of central tendency than the 
00 169 mean ? Give reasons. 
40. 113 25. The daily expenditure of 100 families is given as under : 
1 over 148 Expenditure : 0—10 10—20 20—30 30—40 40—S0 

No. of families : 14 ? 27 ? 15 


(Ans, 95-85; 49-91; 151-82) The median and mode for the distribution are Rs. 25 and Rs. 29 respectively. Calculate 


the missing frequencies. 
(Ans. 33, 11) 
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Measures of Dispersion and Skewness 


2.1. Introduction 
-In the preceding Chapter several measures used to describe the central tendency of a 
frequency distribution were discussed. These measures have their limitations and may conceal 
much pertinent factual information. It is also possible that these measures of central tendency 
may give results which are quite misleading. Thus, a measure of central tendency alone is 
not enough to give a correct picture of a distribution and for this some additional information 
is required. The following information is needed : “ : 
(1) The extent of scatteredness of items around central tendency. This is called 4 l 
dispersion. re 
(2) The direction of scatteredness. This is called skewness. i ~ It lies in 12-16 
(3) The extent to which the distribution is more peaked or more flat-topped than the 
normal distribution. This is called kurtosis. 4 


2.2. Measures of Dispersion ; 
The object of measuring dispersion is to obtain a single summary figure which i 
adequately exhibits the extent of the scatter of the variable values. Various measures of i 
dispersion are : 
(1) Range, Interquartile range and Quartile deviation 
Range. It is difference between the greatest and least values of the variate. 
Interquartile range. It is the difference between the upper and lower quartiles, ' 


ie, Q;—-Q,. | ; 4 
Q,-Q, 
2 


3 
Quartile Deviation. /t is defined to be , where Q, and Q, are quartiles. Q, has ri = 81-7: 


Quartile Co-efficient of Dispersion. /t is defined to be ”. It lies in 24—28 


Q3-Q) 
Q;+Q, 
Ex. 2-1. Compute Quartile Deviation and the co-efficient of dispersion from the following 
data: 
Size Frequency Size Frequency 
4—8 6 24—28 12 
8—12 10 28—32 10 
12—16 18 32—36 6 
16—20 30 36—40 2 
20—24 15 


and Co-efficient of dispersic 


ad Skewness 


cribe the central tendency of a 
eir limitations and may conceal 
se measures of central tendency 
re of central tendency alone is 
1is some additional information 


itral tendency. This is called 


28S. 
1 or more flat-topped than the 


ingle summary figure which 
: values. Various measures of 


n 
lues of the variate. 
per and lower quartiles, 


Q; and Q, are quartiles. 


f dispersion from the following 


Frequency 
12 
10 
6 
2 
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Sol. 


Size Class 
Interval 


Frequency 


C. Frequency 


109 
Q, has ee 27-25 items below it 


. It lies in 12-16 


Q, 


Il 


124-4 27.25-16) 
18 


4 
12 + — (11-25 
ia | ) 


3 
Q; has Fics, = 81-75 items below it. 


-. It lies in 24—28 


and Co-efficient of dispersion 


Q; 


Q.D. 


24+ 4 gi.75 — 79) 
12 


24 + = (2-75) 


= 24 + 0-92 = 24-92 


O;-Q, _ 24-92 —14-5 
2 


2 
10-42 _ 
2 


wn 


21 


Q,;-O, 24-92-14-5 


~ Q3+Q, 24-92 414-5 


45 
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10-42 


=——— =0-26 
39-42 


1 
(2) Mean Deviation. /t is defined by M.D. = W >: 7 |x-—al where ‘a’ is a point from 
which the deviations are to be taken. 
Co-efficient of Mean-Deviation. /t is defined to be 


Mean- deviation about 'a' 
a 


If nothing is mentioned usually mean deviation about median should be calculated. 
Short Cut Method. M_D. is calculated more easily by the formulae 


2 fle b|+(a- ofr yy) 


M.D. about ‘a’ = aSe et 
N 
and M.D. about median (M) = +12 fe- >, sh 
x>M x<M 
Ex. 2-2. Find the mean deviation for the following data : 
Height No. of Height No. of 
(in cms.) students (in cms.) students 
158 15 159 20 
160 32 161 35 
162 33 163 22 
164 20 165 10 
166 © 8 


Sol. It is not given about which the mean deviation is to be calculated. So mean deviation 
about median is to be calculated. 
Calculation of Mean Deviation 


195+1 


Median = Value of (Ae 


= Value of 98th item 
= 161 


th item 
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Mean Dev: 


Theorem 2.2-1. Show that 
from any other value. 
Sol. Let x be the variable. 


Let x,,X2,.....X, be the vz 
by def., 


Mean deviation ab: 


where ‘a’ is any other point. 


+, M.D. about M = ~> 
Nn eM 


As M is the median, the r 
items for which x > M. 


x 


x<M 


M.D. at 


(i) Leta<M 
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26 


al where ‘a’ is a point from 


lian should be calculated. 
: formulae 


No. of 
students 


OU bBWNH HS OK NW 


em 
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Méan Deviation = 334 
_ Mean eviation = 195 
=171. * 
Theorem 2.2-1. Show that the mean deviation from the median is less than that measured 
from any other value. . 
Sol. Let x be the variable. 
Let x, X3,.--..X, be the values arranged in ascending order. Let M be the median. Then 


by def., 


Mean deviation about M = ») |x; - M | 
i=] 


- = )e- Me x 


nt <M NSM 
= ~Su- N42 DG M) 
Nn cM } n SM 


~ S- at+a- 042 DO- -a+a-M) 


neM n SM 


where ‘a’ is any other point. 


. MD. about M = <u- ~a)t— ~ ya n+ Dor a)+— (a -M) 


nM n cM N ioM n SSM 
--y- a+~D(a- x) +— ~ Ye -a)-—(M- a) 
nM nN cM NSM NSM 


As M is the median, the number of items for which x <M _ is equal to the number of 
items for which x > M. 


> (Mt -a)= DM -a) 


x<M x>M 


M.D. about M= | aXe x)+— =~ a) 


nyoM 


(ij) Let a<M 


Tle n+ Ya- x) Le a= Yee a) 


x<a Nn exeM >a "usx>a 
1 1 2 
= [PXe-o+t Ne -a| vis SG -a) 
i x<a + x>a He a<x<M 


lx 2 
=~ ila —al oe Yi -4) 
i=l 


a<x<M 
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As in second term, x > a, 


(i) By Direct method, 
Mean deviation about medi 


* Yix-a) 


a<x<M 
is non-negative. 
.. Mean deviation about median is less than that measured from any other value ‘a’. 
(ii) Let a> M. (it) By short cut method, 
Mean deviation about medi 


1 1 
Here M.D. about M =— xC —x)+-— > (r-a) 
n n 
x1<M x>M 


| =< »; (a-2)-— Ye-9+2 Yo-a+- Va-@ 


Now >) fr=291 and > 


M<x<a M<x<a x>a a>x>M 
x>M x<M 
n 
at 2 : 15 
= = Ix, solar xC ~ x). “. M.D. about median = — 
né n Li 
i=] M<x«<a 


Since second term is non-negative, mean deviation about median is less than that Ex. 2-4. Show that the mean 


measured from any other value ‘a’. in the form. 
Ex. 2-3. Find the mean deviation about median from the following data : 
SN. Marks SN. Marks S.N. Marks 

1 17 7 41 13 11 

2 32 8 32 14 15 

3 35 9 a 15 35 where f, is the frequency of the 
: : a fe . neh Mean deviation about 1 
6 21 12 22 18 12 


(4) by direct method. 
(ii) by short cut method. 
Sol. Arranging Marks in ascending order : 


Marks (x) 
11 


; Now fi - => fi 
3 i i 
: ©. DAG + VAG, 
6 Xj <X x,>X 
; o § DAGi-D= AG 
9 x;> x;<X 


: 2 x ~ 
aay AE) = 5 
18+1 x, <¥ 


Median = Value of = 9-5th item 


(3) Variance. It is defined t 


_ Value of 9th item + Value of 10th item 
2 
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Porat cil ant dtel ae a 


2d from any other value ‘a’. 


yut median is less than that 


following data : 
SN. Marks 
13 11 
14 15 
15 35 
16 23 
17 38 
18 12 


item 
alue of 10th item ; 
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2 21+22 = 21-5 
2 
(i) By Direct method, 
Mean deviation about median 
-Dl_3,4 
18 


(ii) By short cut method, 
Mean deviation about median 


Now "fr =291 and fr = 140 


x>M x<M 


. 15] 
.. M.D. about median = ore =8-4. 


Ex. 2-4, Show that the mean deviation about the mean x of the variate x can be written 
in the form. 


Ds Zin] 


where f, is the Srequency of the value x;. 
Sol. Mean deviation about mean is given by 


S= WAH 
2 WAG) Dali -) 


Now ye Fs = Nx-Nx=0 


. LAG -D+ YK, -D=0. 


or DAG -X)= DAG-%) 
; $=) fi-x)= aE! Sia 


(3) Variance. It is defined by 


Ha = Be 3) 
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Standard Deviation. /t is the positive square root of the variance. 
Mean Square Deviation. Mean square deviation about the pt ‘a’ is defined by 


1 
b2'(a)=— Uf (x-a)’. 


Root Mean Square Deviation. /t is the positive square root of mean square deviation. 
Co-efficient of Variation. /¢ is defined to be 


d 
Co-efficient of Dispersion. /t is defined by an 


For a given data s.d. is obtained by the formula 


1 1 : 
do = hl—> f.X2 -| —L FX]. 
2 eae (tz0x) 


where X= : i 


Ex. 2-5. Calculate the mean and s.d. of the following values of the world’s annual gold 
output (in millions of pounds) for 20 different years : 
94 95 96 93 87 ~=6©79 73 69 68 67 
78 82 83 89 95 103 108 117 130 97 
Also calculate the percentage of cases lying outside the mean at distances 


to, +20, +30 where o denotes the s.d. 
_ Sol. Arranging the data in ascending order : 
Output (x) Output (x) 


Arranged =x- an Arranged 
in order in order 


ee cc a 


2X = 128-125 =3 


EX? = 2982+2131=5113 
3 
A.M, = 90+} — |=90+0-15 
& 


= 90-15 million pounds 
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Now, me 


. No. of cases outside t! 
.. Percentage of cases 01 


No. of cases outside the: 
mean +20 
.. Percentage of cases 01 


No. of cases outside the : 
ie, 90:15+47-97 or 


.. Percentage of cases 01 


Ex. 2-6. The distribution 
factory is shown below. Com} 
the distribution : 


Max. Loc 
9-3. 

9-8. 

10-3- 
10-8. 
11-3- 
11-8- 
12:3. 
12-8. 
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— EE ee a 
? variance. 


the pt ‘a’ is defined by , (ts 


(ts 


oot of mean square deviation. _ : 
y: a5 (5113)- us 


= _ 1022609 


20 
= | Jionsi 
20 
319-767 
2 sO = — —— =15-99 million pounds. 
fX 20 
Now, mean +o = 90-15+15-99 
= 106-14,74-16 
.. No. of cases outside the range 74-16 to 106-14=7 
tes of the world’s annual gold .. Percentage of cases outside the mean at distances +o 
69 ~—s «68 67 ee x 100 = 35%. 
17 13097 20 
‘de the mean at distances No. of cases outside the range. 


mean +26 i.e., 90-15+31-98 or 58-17 to 122:13 = 1. 
.. Percentage of cases outside the mean at distances +26 


= | 100 =5% 
20 


No. of cases outside the range mean +36 
ié., 90-15+47-97 or 42-18 to 138-:12=0 


.. Percentage of cases outside mean +30 = 0%. 


Ex. 2-6. The distribution of maximum loads in tons supported by cables produced in a 
factory is shown below. Compute the standard deviation and the co-efficient of variation of 
the distribution : 


Max. Load (in tons) ~ No. of cabies 
9-3—9-7 2 
9-8—10:2 5 

10-3—10-7 12 
10-8—11:2 17 
11-3—11-7 . 14 
11-8—12-2 

12-3—12-7 3 
12-8—13-2 1 


60 


52 


9-3 9-7 
9-8—10-2 
10:3—10-7 


10-8—11:2 
11-3—11-7 
11-8—12:2 
12-3—12:7 
12:8—13:2 


logig (S.D.) 


S.D. 


A.M. 


.. Co-efficient of variation 
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= oe 7860 -121 
1 
= — ¥7739 
120 
= ; logig (7739) — logy, (120) 


= 5 (38887) —2-0792 


= 1:94435 — 2-:0792 


~ 1-8652 
= 07331 ~ 0-733 tons 


ll 
= 104053) 


11 1331 


+— 
120 3120 
11-092 tons. 


S. 
Ga M -} 000 


0°733 \ 109) 
11-092), 


MEASURES OF DISPERSION AND SKE 


Ex. 2-7. (a) Find out the co-ef, 
Va 

Mea. 

(0) If in a series which is not h 


approximate value of its s.d. us 


Sol. (a) S.D. = 148-6 =12-1 


Co-efficient of variation 


(5) M.C 
But MLC 
S.D 


Ex. 2-8. Calculate s.d. of data gi 
frequency of less than type) : 

"Marks St 

80—84 

75—79 

70—74 

65—69 

60—64 

55—59 


Sol. 


Mid points 
(x). 


FATNAAAwYwW OH 
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Ex. 2-7. (a) Find out the co-efficient of variation if 
Var = 148-6 
Mean = 40 
(5) If in a series which is not highly skewed the mean deviation 7-8, what would be the 


al 


4 
; approximate value of its s.d. [se M.D. = | sa 
1 
0 Sol. (a) S.D.= 148-6 =12-19 
1 Co-efficient of variation 
2 
3 = Ge (100) 
4 A.M. 
_ 12-19 (100 _ 1219 
Z 40 40 
= 30-5% 
(4) M.D. = 7:8 
4 
But MD. = =(SD.) 
S.D. = 2 (M.D.) = 2 78 
one, 4 i ) 4 ( ) 
39 
(120) = race 


Ex. 2-8. Calculate s.d. of data given below by the method of summation (using cumulative 
Jrequency of less than type) : 


Marks Students Marks | Students 

80—84 1 50—54 6 

75—79 1 45—49 6 

70—74 1 40—44 6 

65—69 4 35—39 3 

60—64 4 30—34 0 

55—59 7 25—29 1 
40 

Sol. 
Mid points Freq. (f) 
~. 


PAIN ANAAWOH 


(Contd.) 
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Standard deviation by the method of summation is given by 


SD = hy2F, -F 


where h = common paren 
F, = The Sum of Cumulative Frequencies (less than type) 
divided by the number of items. 
F, = The sum of Cumulative Frequencies (less than type) of the Cumulative Frequencies 
(less than type) divided by the number of items. 


Here h=5 
_ 270 
1 40 
p, = 1154 
2 40 


gp, = 5,{2308 270 _ (270) 
= 40 40 40 


_ = 92320 — 10800 — 72900 


= 11-61. 
Ex. 2-9. Calculate the s.d. of data given below by the method of summation (using 
more than type cumulative frequency). 
Sol. 


S.D. by the method of summation is given by 


S.D.= hy 2F, —F? -F, 


where / = common class-interval 
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F, = The sum of cumulative F 
items. 

F, = The sum of cumulative Fre 
(more than type) divided by the nur 

Here r 


F 


F. 


4 


S.D 


Ex. 2-10. The following table 
companies A and B. Find out which 
Share A: 318 322 
ShareB: 2542 2542 

Sol. Arranging the values in asx 


For Share A, 


A.M. 


SD. 
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iven by 


erval 

lative Frequencies (less than type) 
mber of items. 

pe) of the Cumulative Frequencies 


40 
)— 72900 


‘he method of summation (using 


Second Cumulation 
(c. Freq. of c. Freq.) 
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a 


F, = The sum of cumulative Frequencies (more than type) divided by the number of 
items. 

F, = The sum of cumulative Frequencies (more than type) of the cumulative frequencies 
(more than type) divided by the number of items. 


Here h=10 
ee 1045 
1 230 
oe 3239 
2" 936 


2(3239) 1045 (1045\* 
S.D, = 10 J - —— - | —= 
230. 230 ~\ 230 


= 5 V(6478)(230) — (1045)(230) — (1045)? 
= xs ¥ 1489940 — 240350 — 1092025 


= ES 157565 = 17-258. 
23 


Ex. 2-10. The following table gives the fluctuations in the prices of shares of two 
companies A and B. Find out which of them shows greater variability ? 

Share A : 318 322 325-312 324 315 308 319 

ShareB: 2542 2542 2534 2532 2545 2530 2566 2550 

Sol. Arranging the values in ascending order : 


308 -10 
- 6 

53 

0 


1 
4 
6 
7 


For Share 4, 


A.M. = 318-— = 318-0-125 


= 317-875 


247 ry 
SD. = 4|——-| -= 
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Ex. 2-12. Show that if the vari 


= 5 Vi076- =< Vi975 


s (44-44) = 5-555 


tional to the binomial co-efficients 


n 
the dist is 3° the mean square dev 


S.D. 
Co-efficient of variation A: ra ) 


Sol. AA 
5- 
= 22 (100) 
317-875 
= 17:5%, 
For Share B, 
A.M, = 2542 + > = 2542-625 
appt oe tey: 
. se 8 8 
S 57656 ~ 25 J /7631 19’ (C 
1 
= ri 36) ~ 10-92 
10-92 
Co-efficient of variation (= 42. =.) (100) 
_ 1092000 = 0-43%. 
2542625 
Since co-efficient of variation for share A is greater than that for share B, share A shows 
greater variability. 
Ex. 2-11. Of a final examination in statistics, the mean marks of a group of 150 students 
were 78 and the s.d. was 8-0. In Economics, however, the mean marks of the group were 73 
and the s.d. was 7:6. In what subject was there greater variability ? 
Sol. Co-efficient of variation for statistics paper 
8-0 800 
= | —— |(100) = — 
( 78 )c : 78 I 
= 10-3%, Ex. 2-13. Find the mean devi 


Co-efficient of variation for Economics paper 


: (7:8) a0 = 
73 73 


a+2nd, and prove that the latter i. 
Sol. A.M. is given by 


= 10-4% 
.. In Economics there was greater variability. 
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625 


Vv 7631 


™~w 


in that for share B, share A shows 


marks of a group of 1 50 students 
nean marks of the group were 73 
rlability ? 
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iS ie 


Ex. 2-12. Show that if the variable takes the values 0, 1, 2,...n with frequencies propor- 


tional to the binomial co-efficients 1, "c, "ey... "c, respectively, then the mean of 


n(n +1) 
4 


n cert ; : _ A 
the dist is >” the mean square deviation about x =0 is and the variance is ve 


Sol. AM 
14+" ¢, +" cyt. 
nfisinp + @—DO=2 
DS 
Le C +" C2 tte, 
ee 
=A. =_— 
Dn 2 
i 3 Oc reas” cy+...4n?” Ch 
H2 (0) = se ae 
ee l . 2n = 1 7 1 ‘ 
= poe oe. =r DBO )+x}." cy 
x=0 x=0 
l< 1x? 
AS aenterd Siete, 
ant x=0 
: ‘ ys n n 
= (2.1" cy +3.2." c3+...+n(n—-1)"c,} += 
2" 5 
= J nn-d4)™ yf na) 42 
2% 2 ro 
_ Math) 
4 
n(nt+1) nn? 2” 
Ho FSS ee eee 


4 4 4 
Ex. 2-13. Find the mean deviation from the mean and the s.d. of the A.P. a, a+d......... 
a+ 2nd, and prove that the latter is greater than the former. 
Sol. A.M. is given by 


ad 


a+(a+d)+..4+(a+2nd) 1+2+....42n 
ng dd 0 
2n+1 


2n(2n+1) _ 


=atnd 
2(2n +1) 
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.. Mean deviation from the mean 


1 - 
= ———[{|-nd|+|-(n-1) d}+....+|d]} + {|d}+....+|nd]}] 
2n+1 
2 
8 ete ee 2 
2n+1 One e 


S.D. = Sake (ar #((n=1)d)? 4.(-dY HE 4nd} n | 
2n+1 ’ S<r<— 
: n- 1 
2 
= 2d "2 a2 ea =, 2n(n +1) (2n+1) Theorem 2.2-2. Show that the 
2n+1 6 (2n +1) measured from the mean. 
Sol. Let ‘a’ be an arbitrary poi 
Jr(n+]) n 
sid JSS ' 1 
3 ba (@=— 2 f 


Consider 


2 2 
, n’(n+1 n(n+1 
(Mean deviation)?—Variance = a poor = ae 


1 
(2n+1) 3 ON 
| i=] 
é Mn+ De Bn(n+1)-(4n? + 4n41)} 
3(2n +1) : 
= Sil 
i=l 
- nee ie +n+1)<0 
3(2n-+1) , ra 


.. Mean deviation < S.D. ' 3 2. 
2 (a)—H2 = (%-a)": 


1 n oo 2 ' 
Ex. 2-14. If r be the range and S= fe dG -»'} be the s.d. of a set of ne H2 (4)2H2 
ne jal 


Vh2'(a) > Jur 


“. The root mean square dev: 
mean. , 


observations X 1, Xq 000066 x, then show that 
n )2 
S<r|— 
. \n-l 


and Xp, = MIN. (2X1, XQ... x,) 


Ex. 2-15. /n a series of measur 


of magnitude x. and so on. If x is 


2 
spe jee Xr) _§2 
um, 


where ¥ =k +8 and k is any consti 


Sol. Let x, = max.(x;, x2...x,) 


Then r = x, — x; 


= X) + X44... 4X Xp tXyt... +X 
Nog n> XK t%e key, 


n n Sol. bh. 


(x; —X)* < (4, -x,)° 


S.D 
a s? = ! YG -9 eee See i 
n-1 al : ~n-l = Pook 
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+|d]} + {|d}+....+|nd]}] 


1d 
ap) 
P+.(—d)?} + (a? +...4n2d?}] 


[an +1) (2n+1) 
6 (2n +1) 


n+l} 


1 
2 
°*} be the s.d. of a set of 
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ae et = it 
<— {Sc Xp)" p> 
n-1 1/4 
i=l 
nr? 


cl (. xX, -X, =r) 


1 
ssr{ t } 
n-l 


Theorem 2.2-2. Show that the root mean square deviation is least when deviations are 
measured from the mean. 
Sol. Let ‘a’ be an arbitrary point. Then 


’ = 1 _ ya 1 . -,c- 2; 
Ho = Dita) wy Dyfi F480) 


where x is the A.M. 


-— AAG; -x)? +(¥-a)? + 2(x; - x) (z-a)} 

i=] 
Bes n : ae S 21 n = “As n ; +2 
Hy ahi x)? +(%-a) Wad +2(X Oy DIG x) 
= Hy +(¥-a)? 


Hy (a)~H =(¥-a)’ 20 


or Hy (a)>p, 


Vito (a) > hp 


“. The root mean square deviation is least when deviations are measured from the 
mean. : 


Ex. 2-15. Jn a series of measurements we obtain m, values of magnitude x,, m, values 


of magnitude x and so on. If x is the mean value of all the measurements, prove that 


a 2 
De jPee XJ" _s2 
=m, 


where x =k +8 and k is any constant. 


Sol. He = wy (k)-(%—k)? =p, (k)-8? 
= 2 
SD. = om, (k —x,) _ 32 


<m 


r 
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Theorem 2.2-3. Show that S.D. is independent of origin but not of scale. 
Sol. The transformation corresponding to change of origin and scale is 


y = 224 
; h 
where ‘a’ corresponds to change of origin and / to change in scale. 
: x=a+Uh 
l< ce 
Then B= Dil) = yD Sila + Uh) 


i-l izl 


l< ee 
= ath) AU; =a+h0 


i=l 
ly = lx v.-vy 
U2 = w dahl F = youl ;7~U) 


= h? w, for U 


My for x = h? ww, for U 
.. Variance and hence s.d. is independent of origin but not of scale. 
Ex. 2-16. From a sample of n observations, the A.M. and variance are calculated. It is 
then found that one of the values x, is in error and should be replaced by x,|. Show that the 
adjustment to the variance to correct this error is 


1 i 2 
pce ke Ar) 
n 


where T is the total of original observations. 


Sol. Let ¥ and o” be the calculated values of A.M. and variance. 


Then Xx; = nx 
=\2 2 

and U(x; -x)" = no 
A 2 = _ 2 
Le., ie —nx2 =o 


2 — 
Ix; = no? +nx? 


Now corrected value of ps i (= ~xX+ x] 


Corrected value of Er,” im (zx, may x{”) 
1 ' 
“. Corrected value of A.M. = = —{2x;-x, +, ) 
n 
_ x, = x; 


x+ 


n 
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a ee 


and corrected value of ee 


~. Corrected value of varianc 


. Adjustment to the varianc 


Now 


.. Adjustment to the varianc 


Ex. 2-17. For a frequency di 
in intervals 0—5, 5—10, .....etc.) 
discovered that the score 43 wa. 
corrected mean and s.d. corresp 

Sol. Since the score 43 wast 
45 and 50—SS5, in the calculation 
of the actual value 42:5. 

Now if x be the variate, 


- .. Corrected value of 


.. Corrected value of mean 


Also 
se Ve 


en 
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‘igin but not of scale. 
origin and scale is 


re in scale. 


zsathU 


ix = 
=W? — §U,-Uy 
NO 


ut not of scale. 
and variance are calculated. It is 
de replaced by x, . Show that the 


") 


ind variance. 
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Sa ter ih NA 


L. 1 : 12 
and corrected value of — Ex," = — {Ex - <i +x, } 


.. Corrected value of variance 


! 
Q 
N 
+ 
oi 
r) 
ae 
a 
| 
+ 


.. Adjustment to the variance to correct the error 


= corrected value of variance — 6? 


2 2 
' oe Ud 
as x —-x —- Xx -x 
oo fda AIL hs gS aL 
n n 


1 ’ ’ _ —< 
= —(xX, —X1) 4X +, ey ee bene 
n n 
I T 
Now x= Uy ET (T= 2x;) 
n n 
.. Adjustment to the variance 
1, ' "x, +2T 
n n 


Ex. 2-17. For a frequency distribution of marks in History of 200 candidates (grouped 
in intervals 0—5, 5—10, .....etc.) the mean and s.d. were found to be 40 and 15. Later it was 
discovered that the score 43 was misread as 53 in obtaining the frequency dist. Find the 
corrected mean and s.d. corresponding to the corrected frequency dist. 

Sol. Since the score 43 was misread as 53 and the scores 43 and 53 lie in intervals 40— 
45 and 50—55, in the calculation of mean and s.d. variate value was taken to be 52:5 instead 
of the actual value 42:5. 

Now if x be the variate, 


Ex = (40)(200) = 8000 


_ .. Corrected value of 


Xx = 8000-52-5+42-5. 
= 7990 
.. Corrected value of mean 
7 7990 = 39.95 
200 
Also s.d. = 15 
Bs Var(x) = 225 


S(x— x)? = (225)(200) = 45000 
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or Sx? — Nz? = 45000 
Ex? = 45000 + (200)(1600) = 365000 
.”. Corrected value of x? 
= 365000 - (52-5)? + (42-5)? = 364050 
. Corrected value of 5(x—x)* = (Corrected value of 5x?)—N (corrected mean) 
= 364050 - 200(39-95)? = 44849-5 
.“. Corrected variance 
_ 44849-5 
200 


“. Corrected s.d. = 224-2475 = 14 97. 


Ex. 2-18. The mean of 5 observations is 4-4 and the variance is 8 24. If three of the five 
observations are 1, 2 and 6, find the other two. 


= 224-2475 


Sol. Let x, and x, be other observations. Then 
(4-4)(5) =(1+2 +6) + (x, +x) 
or X, + xX_ =22-9=13 .(1) 
and = (8-24)(5) = (1-4-4) +(2- 4-4)? +(6-4-4)? +(x, - 4-4)? +(2, -4-4)" 
xy" +x,” =97-0 
Now 2(x,” +x5")=(x, + 3)? (4, =)? 


4 X,;—%X2,=5 (taking positive sign) ..-(2) 
From (1) and (2) 

x, =9, x, =4. 
Ex. 2-19. [f the mean and s.d. of a variate x are m and 6 respectively, obtain the mean 


and s.d. of gene where a, b and c are constants. 
c 
Sol. Let U = ax tb 
c 


Let U and Oy be the mean and s.d. of U. 
~ 1 ax+b\) 1f 1 1 
Then T=" A{ : J-HetDarot st 


axt+b _amt+b 


c c 


. 1 = a’ | 2 a’ 
and Oy” SD Cau) aerate Nice =o" 
c c 
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Theorem 2.2-4. Show that tl 
Sol. Let x be the A.M. Then 
ie., S.D. 2 Mean deviation 


. 1 
ie., J— 
N 
Le. 
where Yt 
ie. (fit fotthy) (fy 4 
1.€., AAO ~y)"4 


which is true. 


Ex. 2-20. Show that if the de 


x 
and higher powers of (=) may 


(iii) H+M =2G. 
(iv) M? -G? =0° 
(v) MH =G’. 


(vi) mean (vx) =/M t — 
Sol. (i) By def. 
1 n 
log G = woud log x; 


Let X; =x,;-—M so that x; 


lo: 
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Theorem 2.2-4. Show that the s.d. is not less than the mean deviation from the mean. 
Sol. Let X be the A.M. Then it is to be proved that s.d.4 mean deviation from the mean. 
i.e., S.D. 2 Mean deviation from the mean 


ila -xy> wy ats —3 


)) = 365000 


(42-5)? = 364050 


° Sx”)—N (corrected mean)? 


5)? = 44849.5 


2 
i.e, NY fii? 2 [Ss 
i=l i=l 


where y= |x; —x| 


15 


ie, fi + fatethy) Ayr te tSaYn) 2h + faa tet SaYn) 


i.e., Fp fo Hyg)? teveeee 20 
which is true. 


‘ance is 8-24. If three of the five 


3 
x 
Ex. 2-20. Show that if the deviations are small compared with the mean so that (=) 


(1) M 


* +(x 4-4)? +(x, - 4.4)? x 


and higher powers of (=} may be neglected, 
a o . 
(i) G=M|1-——= | where ‘G’ is the G.M. and ‘M’ the A.M. and ‘o’ the s.a. 
2M? 
.-.(2) 
o 
(i) H= M|1-—> where H is the H.M. 
M2 


respectively, obtain the mean 
(iii) H+M =2G. 


(iv) M?-G? =0° 
(v) MH =G’. 


ie 
(vi) mean (vx) = sil 3} 
Sol. (i) By def. 
1 n 
log G = youd log x;. 


Let X; =x,;-M so that x,=X,+M 


ix 
logG = woul log (X; +M) 
i= 
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ly xX, 
== slog M+log|1+— 
eA 4 s( + 


i< X, 
= lo M+—) log |1+— 
F Naas e( x) 


i 


3 
; : : X 
Applying expansion of log (i + *) and neglecting (#) and higher powers 


= 


Bogen Pe de 
log G = log M+— ey) BE SRE 
eM A Waa 


| ie se to Pigey 
=logM+—-— ) f.X,- — x, 
M rps 2M? woud 


G hg = 
“. log 7 =7 o [: > AX 7 | 
M i=l 


(Applying the expansion of e 2” and neglecting higher powers) 


l1 If 
ji) By def. —=— ) & 
Gi) By def. = Nas 


o o 
(iii) From (ii) H+M =M|2-—~ |= 2M|1- 
M? 2M? 
2 


G 


- Be 3 
(iv) From (i) G? = M? f . a =M? f = z:) (neglecting higher powers) 
M 
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 M?-G2=02 
(v) From (ii) MH = M? -o° 


(vi) mean 


Ex. 2-21. Show that, if the 


3 
(=) and higher powers may 


where V is the co-efficient of va 
Sol. From last Ex. 


2(M - 


V= co-efficient of vari: 


2.3. Combining number 
M,,My...Mz, SIZES Ny, Nz,----- 
mean and s.d. of the new distril 


and 
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(i) 


i 


x.) 
x) and higher powers 


[: » AX; = | 


2x powers) 


glecting higher powers) 
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-M2-o? 
. M?-G =e? 
(v) From (i?) MH = M? —o? =G? [from (iv)] 


(vi) mean , d=) fils 
- i=l 


Ex, 2-21. Show that, if the deviations are small compared with the mean M so that 
3 
(=} and higher powers may be neglected. 


| Ve {2(M -G) 
M 


where V is the co-efficient of variation. 
Sol. From last Ex. 


2M -G) = — 

M 
: he s.d. oO 
V= co-efficient of variation = =— 
mean M 


g [20M -G) 
~ M 


2.3. Combining number of distributions. Jf k-distributions with respective means 
M,,My...Mg, SIZES Ny, Nyy... ny ANd S.d.8 01, Oy ,..000 6, be combined together, the 
mean and s.d. of the new distribution are given by 


nym +NyM, +...+Ny My 


ny + Ny +....+Ny * 


d oc nor +N 03 +...+nmp 
an ee 
Ny + Ny +...Ny 
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2 2 2 
¥ ny(m—m,)° +n,(m—-m,)*+....n(m—m,) 
ny aD ny +...+Ny 


Ex. 2-22. The standard deviations of two sets containing n, and n, numbers are 06, 
and ©, respectively deviations being measured from their respective means m, and m,. If 


the two sets are grouped together as one set of (n, +n.) members, show that the s.d. o of 
this set measured from its mean is given by 


2 2 
nO; +150 Ayn 
eo ee 1 (m, —m,)? 
ny +n (n, +n) 


Sol. Let %),%25----%,, and Yj» Y2-+--Yn, be the members of two sets. Then by def. 


m U7) 


Let m be the mean of the grouped set. Then 


- 2 nym +n om 
Wes Vx, Ped Cle Vea ALL 8 
ny +n, ny +n 


Now (t= De -m)? nts Lor my? 


ny +Ny 


i=l j=l 


err xe —m, +m, - m)? +Y0, —My +m, - ‘I 


= Te |e —m,)* + 2(m, - mds m+ oem —m)* 


i=] 


S07) -m)? +2(m, - EM Oe m) 


j=l j=l j=l 


1 
= {mo +n, (mn, —m)* +n 05° +(m, —m)°ng} 
ny +n 


2 2 2 2 
_ Moy +292 5 nym — m)" +n (mz —m) 
ny + hy ny, +N, 


2 1< 
==); —m,)* ete. 
eA 
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Now (my, -1 


and (my | 


Ex. 2-23. An analysis of th 
belonging to the same industry, , 


No. of wage earners 
Average monthly wage 
Variance of the dist. of wag. 


(a) Which firm A or B pays 
(b) In which firm A or B is 1 


(c) What are the measures 0; 
wages of all the workers in the f 
Sol. (a) Firm A pays = (52: 
Firm B pays = (47: 

‘. Firm B pays more as mo1 


(b) Co-efficient of variation 


Co-efficient of variation for 


*, Firm B has greater variat 


(c) Average monthly wage a 
the firm A and B taken together, : 
by Firms A and B together. 


Let m and o be the A.M. anc 


Then 


and 
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2 
(m~ my)? +....n,(m—m,)" ny (m, —m,)* 
A YA) 2) Ma) 


2 
Now (m, —m) 
2 
p tig t..+My (ny, +H) 
ling nm, and n, numbers are ron 2 2 
. d (m,—m)? = sa HSL ade ES vac) 
respective means m, and m,. If = ‘ * 


(nm, +nz)* 
sembers, show that the s.d. o of 


122 2 
=> (mm, ~my) 


tn)? 


ers of two sets. Then by def. 


L< 
cial > yj 
Ny a 


. _ nym +N mM, 
Sr “eae 


1 ny +N, 


~My +m, —m)? 


Deyn? 
+ om ~m) } 
i=l 


2 2 


2 
-m) +n (m, —m)* 
et IMI In) 
ny + Ny 


o; eae —m,) “ 


i=] 


2 2 
6? = Si F292, moti =m)? 
ny +n (m +2)" 
Ex. 2-23. An analysis of the monthly wages paid to workers in two firms A and B, 
belonging to the same industry, gives the following results : 


Firm A Firm B 
No. of wage earners ~ 586 648 
Average monthly wage Rs. 52:5 Rs. 47-5 
Variance of the dist. of wages 100 12] 


(a) Which firm A or B pays out the larger amount as monthly wages ? 

(b) In which firm A or B is there greater variability in individual wages ? 

(c) What are the measures of (i) average monthly wage and the variability in individual 
wages of all the workers in the firms A and B taken together ? 

Sol. (a) Firm A pays = (52:5) (586) = Rs. 30765 monthly 

Firm B pays = (47-5) (648) = Rs. 30780 monthly. 
. Firm B pays more as monthly wages. 
(6) Co-efficient of variation for Firm A 


(100) = > = 19-05% 
as : 52-5 


Co-efficient of variation for firm B 


-(2 
47-5 


Jo 00) = 100 os. 16% 


.. Firm B has greater variability in the individual wages. 


(c) Average monthly wage and the variability in individual wages of all the workers in 
the firm A and B taken together, are the A.M. and co-efficient of variation of the wages paid 
by Firms A and B together. 


Let m and o be the A.M. and s.d. of the wages paid i A and B together. 


7 (586)(52 -6) + (648)(47-5) 


bi 586 + 648 


= 49-87 
Bae (586) (100) + (648)(121) 


and 1234 


i: (586)(49-87 — 52-5)" +648(49.87 ~ 47-5)* 
1234 
= 117:26 
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.”. Co-efficient of variation for firms A and B taken together 


a ¥117-26 ~ 100 = 1082-87 
49-87 49-87 
= 21-7%. 
Ex. 2-24. The first of two samples has 100 items with mean 15 and s.d. 3. If the whole 


group has 250 items with mean 15-6 and s.d. V13-44, find the s.d. of the second group. 


Sol. Let m. be the mean of second group 


(100) (15) + (150)m, 
15-6 = 
Then 5 950 
ae (250)(15-6) — 100(15) ae 
150 


Let 6 be the standard deviation of second group. Then 


13.44 — £40009) + (150) 07] + 100(0-6)? +150(0-4)” 
250 
: o=4. 

Ex. 2-25. The mean and s.d. of 63 children on an arithmetic test are respectively 27-6 
and 7:1. To them are added a new group of 26 who have had less training and whose mean 
is 19:2 and s.d. 6-2. How will the values of the combined group differ from those of the 
original 63 children as to the following (i) the mean (ii) the s.d. 

Sol. Mean m and s.d. 6 of the combined group are given by 


_ 63)(27-6) + (26)(19-2) 
@) = 63 + 26 


. The A.M. is decreased by 27:6-25-1=2-5 


= 25-1 


Go? (63)(7 +1)? +(26)(6-2)° ‘ 63(25-1-27-6)? +26(25-1-19+2)° 
L = ———_—_—_!_ > Seen eee EEE 
63 + 26 63 + 26 
J 
*, © = 7-8 (approx) 
*, The s.d. is increased by 7-8—7-1= 0-7 (approx. ) 

Ex, 2-26. A distribution consists of three components with frequencies of 200, 250 and 
300 having means 25, 10 and 15 and s.d. of 3, 4 and 5 respectively. Show that the mean of 
the combined distribution is 16 and s.d. 7-2 approximately. 

Sol. Let m’and o be the mean and s.d. of the combined distribution. 
_ (25200) + (10)(250) + 5)(300) 


Hien 200 + 250 + 300 


5000 + 2500 + 4500 12000 _ 
750 750 


d 52 = (200)(3*) + (250)(4") + (300)(5") 
- ? 200 + 250 + 300 


_, 200{16- 25}? +250{16 —10}7 + 300{16 — 15}? 
200 + 250 + 300 
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ica tas lea Oe a 


i 


2.4. Moments. The rth momen 


Mt, (a) 


If ‘a’ is A.M, rth moment abou 
Factorial Moments. Factoriai 


Hor) 


(vr 
Absolute Moments. Absolute i 


where Xx 


to be 


Pearsons B and y Co-efficient. 


Moments about mean in terms 


Mp = By") By 
Shappard’s Corrections to Mo 


No correction applied to odd « 


3 (corrected) = pp; and pp | 


lio 
4 (corrected) = H4 ae Wy 


2.4-1. Moments about mean a 


The transformation correspon 


where x and y are A.Ms. 


MATHEMATICAL STATISTICS 


ogether 


1082-87 
49-87 


:mean 15 and s.d. 3. If the whole 
nd the s.d. of the second group. 


57]+ 100(0-6)* +150(0-4)? 
250 


thmetic test are respectively 27-6 
‘ad less training and whose mean 
‘d group differ from those of the 
he s.d. 
ven by 


(0?) 225.1 


2 +26(25-1-19-2)? 
3+26 


with frequencies of 200, 250 and 
ipectively. Show that the mean of 
y. 
d distribution. 
150) + (15)(300) 
) + 300 


00 _ 12000 _ 
750 


(47) + (300)(5) 
0+ 300 


250{16 — 10}? + 300{16 — 15}? 
0 + 250 + 300 
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Re 


_ (1800 + 4000 + 7500) + (16200 + 9000 + 300) 
750 


_ 38800 
750 


O= ¥51-73 =7-2 (approx.) 


2.4. Moments. The rth moment about the point ‘a’ is defined by 


=51-73 


1 r 
uu, (a) = rT Uf(x—a) 


If ‘a’ is A.M., rth moment about ‘a’ is denoted by w,. 
Factorial Moments. Factorial moment of order ‘r’ about the origin is defined by 
ope 


; 1 
Hor) = N 


where x? = x(x-l).....(x-rt+) 
Absolute Moments. Absolute moment of order r about an arbitrary point ‘a’ is defined 


to be 


= Zf|x —al’ 


Pearson's B and y Co-efficients. These co-efficients are defined by 


11 = VB, and y= $,-3 


Moments about mean in terms of moments about any other point are given by 


r , ' r ’ 2 f ' 
Mp = eh" wy BE + C2 Hy-2{Hif t+(-D" ¢, {ui} 
Shappard’s Corrections to Moments of Grouped Frequency Distribution. 


No correction applied to odd order moment 7.e., 1, (corrected) = My 


h2 
3 (corrected) = 3 and p., (corrected) = H2 ra 


1 2 7 4 
d)=p,-—h +—h’. 
M4 (corrected) = H4 , i) 740 t 


2.4-1. Moments about mean are independent of origin but not of scale. 


x-a 
The transformation corresponding to change in origin and scale is 4 = h 


x=atuh 
x =atuh 


where x and 4 are A.Ms. 
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1 : sr 1 . =\r r 
= iit, of may Dhar D =H DSi =h' -(u, of u) 


2.4-2. Expression of rth moment about mean in terms of various moments about an: 


arbitrary pt. ‘a’. 
Let x be the A.M. Then 


IG, «1% pee 
L, = 2h -® = 9g Lille a) + (4-3) 


- WA ~ a)" +e(x; ~a)'"(a-¥) 


+'C,(x; - id (od ae "c,(a-Xx)"] 


=p(a) + "c, wh_(aa-X) + "cp wh_y(a— x)? +...4"C, (a- x)" 
1 n 
Now a-x =-— (x; -a)=—- ‘(a 
N 2: ra )=—H (@) 


we Hy = Bia) ~ "e; wh y(a) pi (@) + "ey wh _o(@) {ui@}° +..4(-D" 'e, (Hi @y’ 


= oe urey Hi 
j=0 
where Lu, denotes ut, (a). 


2.4-3. Expression of rth moment about a pt ‘a’ in terms of various moments about 
mean. 


Let ) (a) be written as wy. 


gs . , 
Then bl = Hout - a) : 
i=] 


WAM -D+E-aY 


1 Se ek 
WL AAG B+ Hh } 


=H, +r Mpa bi tls, one ed eee +c, {uy} 


r 


Dé H,-j {u, ). 


j=0 
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2.4-4. Expression of rth moment 
other pt. ‘b’. 


r note 5 — ri 
He @ == fii - a = 


i=l 


= .(b)+"¢(b-a) pt ( 

Ex. 2-27. Calculate the first fou 
calculate B, and B5. 

x values in cm, are the mid-poin 


x: 2-0 2:5 c 
ps 5 38 
Sol. 


Variable (x) 


By (3-5) 


H3 (3-5) 
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)Y =h’ -(u, of wv) 


ms of various moments about an: 


= D files -@) + (a2 


+ "c,(x; -a)"(a-X) 


~ x)? +..4%c,(a-X)"] 


0)? 4.4 Ca ky" 


Pray" 'o, (u(@y 


rms of various moments about 


Pete, (uy 


By 2{My{}?.. te, fury 
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2.4-4. Expression of rth moment about a pt. ‘a’ in terms of various moments about any 
other pt. ‘b’. 


Fee ot oe ere eT 
HE = La Filai a) y Qafil-)+O-a)} 


1 silts ~ by" +"c, (x; —b)"'(b-a) 


i=] 
+", (x, ~b)?(b-a)’ +...4"c,(b-a)"] 


=p).(b)+"c,(b—a) pl_\(b) +" c)(b—a)* w_4(b)+....+"c, (b= a)’. 
Ex. 2-27. Calculate the first four moments about the mean of the following dist, also 
calculate B, and B,. 


x values in cm, are the mid-points of intervals : 


x: 2-0 2°5 3-0 3°5 4-0 4:5 5-0 
f: 5 38 65 92 70 40 0 
Sol. 


-3 
2 
-1 
0 
1 
2 
3 


Xd- 
is, (G5). = es (0- 5)=- 5 x0: 5) 
=~ =-0-01 
310 
Xd’ - 492 
Hs 3-5) = 0-5)? = 5 (0-25) 
= 13 _ 9.397. 
310 
' : yi 
133-5) = “Lo.5) 
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pty’ (3-5) 


Moments about the A.M. are : 
Hy 


Ho 


H3 
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_ 114 (9.125) = - 0-046 
310 


1788 


id’. f 4 
—— + x (0.5)* = —— (0-0625 
we OO = a0 ) 


1788 1 447 

—— x — =——- = 0:3 

310 16 1240 

0 

5 3-5) —[pj (3-5) 

0-397 —0-0001 = 0-3969 ~ 0-40 


13 (3-5)— 35 (3-5) pt (3-5) + 2[H}(3-S)P 
—0-046 — 3(0:397) (0-01) + 2(-0-01)° 
-0-034 = —0-03 (approx. ) 


[4 =Ha (3-5)— 4p (3-5) ny (3-5) + On} (3-5)Lq4 (3-S)P — 3[uj (3-5)? 
= 0-360 — 4(-0-046) (0-01) + 6(0-397) (-0-01)? —3(-0-01)* 


= 0-358 = 0-36 (approx. ). 


Bi = 


B. = 


Ex. 2-28. The first four moments of a distribution about the value 4 are —1'5, 17, -30, 
108. Calculate the moments about the mean. 
Sol. Moment about the mean are 


H3 


u,> 0-397)’ 


2 (0-397)? 


0 
by) — py? =17-(-1-5)? 
17 —2-25 = 14-75 
Hy —3Hy by +2p"° 
~30 —3(17)(-1-5) + 2(-1-5)° 
-30+76-5-6-75 
39-75 

? ? , t 2 4 
14 — 45 Wy + On By ~ Shy 
108 — 4(-30)(—-1-5) + 6(17) (-1-5)” —3(-1-5)* 
108-180 + 229-5-15-1875 
142-3125 = 142-3 
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Ex. 2-29. The first four moments | 


‘2, 20, 40 and 50. Obtain as far as poss 


the basis of the information given. 
Sol. The first four moments abou 


Hy = 
Ha = 


H3 = 


A.M. = by (0) : 


11 = VB: 


¥2 = B2-- 

Ex. 2-30. The first three moment: 

1, 16, -40. Find as far as you can, the 
information given. 


Sol. A.M. 
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= —0-046 


ss — 1788 (9.0625) 
310 


ee 0-360. 
‘40 


3-5)/° 

1=0-3969 ~ 0-40 

(3-5)p} (3-5) + 2[Hi(3-5)P 
397) (-0-01) + 2(-0-01)° 

13 (approx. ) 

4(3-5)P -—3[p; (3-5)14 

-01)* - 3(-0-01)4 


ad ia 
I 
= 
oO 
Le) 


y=2-27 


tbout the value 4 are —1-5, 17, -30, 


2"° 


5) + 2-1-5)? 
75 


Ho by? -3y,'4 
5)+6(17)(-1 5)? -3(-1 sy 
‘5-15-1875 

+3 
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Ex. 2-29. The first four moments of a distribution about the value 5 of the variable are 


the basis of the information given. 


Sol. The first four moments about the mean are : 


H; =0 


Ho 


H3 


H4 


A.M. = Wy (0) 


vi = vB 


> —p,” =20-4=16 


bs) —3pt' By + 2p, 

40 — 3(20)(2) + 2(2) 

40-120 +16 

64 . 

4, — 4405 py + 6) wy? - 3p," 

50 - 4(40)(2) + 6(20) (2)? —3(2)4 
50 — 320 + 480 - 48 

530 — 368 = 162 


Bh 
Eels 5) 45] 
wits -5)+5 


py, (5)+5=24+5=7. 


162 81 
(16)? 128 


1 


¥2 = B, -3=0-63 -3 =-2-37. 
Ex. 2-30. The first three moments of a distribution about the value 2 of the variable are 
1, 16, -40. Find as far as you can, the various characteristics of this dist on the basis of the 


information given. 


Sol. A.M. 


1 | 
oe Pee eral 


ay uilx-2)+2 


‘2, 20, 40 and 50. Obtain as far as possible the various characteristics of this distribution on 
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= py (2)+2 
=14+2=3. 
The first three moments about the A.M. are given by 
Hy =0 


M2 = Hy’ =H” =loetel5 


3 = ed = 3p,’ By + 2p" 
= -40 — 3(16)(1) + 2(1)3 
= -40- 48+2=-86 


(-86)? 
(15)° 


v1 = Jp, = 2-19 =1-48. 


Ex. 2-31. The first four moments of a distribution are 1, 4, 10 and 46 respectively. 
Compute the first four central moments and beta constants. Comment upon the nature of 
the dist. 

Sol. The first four;central moments are given by : 


Hy =0 


=2-19 


a 
B= + 
HH 


! 12 
Hy =H, HW, =4-1=3 


M3 = Hs —3h2 By + 2p" 
= 10-3(4)(1) + 201) 
= 10-12+2=0 


wg = Ha —4b5 Wy + On) wy? 3p)" 
= 46 — 4(10)(1) + 6(4)(1)? - 3(1)4 


= 46-40+ 24-3 
=27 
27 
f= 0; es ere 
H2 , 


Since B, = 0,B, = 3, the distribution must be normal. 
_ Ex. 2-32. For a distribution of 250 heights, calculations showed that the mean, standard 
deviation, B, and B, were 54 inches, 3 inches, 0 and 3 inches respectively. It was, however, 
discovered on checking that the two items 64 and 50 in the original data were wrongly 


written in place of correct values 62 and 52 inches respectively. Calculate the correct 


frequency constants. 
Sol. Let x be the variable and N be the total frequency. Then N = 250. 
Then dx = (250)(54) 
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eg Sey 


Corrected value of Sy 


Corrected A.M 
Variance 
I <\2 
or oy L(x -— x) 
or D(x - x)* = (250)(9) 


. Corrected (x-x)* = 225€ 


Corrected varianc 


Now | B, =¢ 
H. 
X(x- Xx) 


.. Corrected X(x - xy 
= 0-(64-54) 
= 0-1000 + 6: 
=— 432 


—432 


: ss] 
.. Corrected 3 750 


(corrected 
.. Corrected 8} = —————_ 
(corrected 


— (¢1-728)" 
~ (8-808) 
= 0-004 


L 
L(x - x 
”. Corrected S(x-x)* = 607 


Corrected } 
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48, 


are I, 4, 10 and 46 respectively, 
ints. Comment upon the nature of 


3 is = 3p," 
4)(1) — 3(1)4 


s showed that the mean, standard 
hes respectively. It was, however, 


the original data were wrongly 
vectively. Calculate the correct 


. Then N = 250. 
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Corrected value of yy = (250)(54) — 64-50 + 62 + 52 = (250) (54) 
Corrected A.M. = 54. 
Variance = 9. 


1 a) 
—X(x-x)° =9 
or 7 ( ) 


or X(x—-X)* = (250)(9) = 2250 


.. Corrected 3(x—x)? = 2250-(64—54)* - (50-54)? +(62—54)? +(52-54)° 
= 2250- 100-16 +64+4 


= 2202 
2202 
Corrected variance = 50° = 8-808 
u 2 
Now 6, =0 = a 
Ho 
3 =0 
X(x- x)? =6 


.. Corrected X(x-x)° 
= 0-(64-54)° ~ (50-54)? + (62-54)? +(52 -54)° 
= 0-1000 + 64+512-8 
=~ 432 
—432 
. == =- 1-728 
“. Corrected 13 750 


corrected 9 
.. Corrected B, So 
(corrected [14 ) 


(1-728)? 
(8-808)° - 
= 0-004 
2. 3tef Ase ee 
ly = 243 


¥(x—x)4 = (243)(250) = 60750 


.. Corrected 5(x— x)* = 60750 - (64 - 54)* -(50-54)* +(62—54)* + (52 -54)* 
= 60750 — 10000 — 256 + 4096 +16 
54606 


34606 _ 418-424 
250 


Corrected p14 
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corrected, 218-424 


C ted a Pe io 
eHeriea es (corrected ly)” (8-808) 


_ 218-4 
(8-808)? 
= 2-815. 
Ex. 2-33. Second, third and fourth central moments of a variable characteristics are 


19:67, 29:26 and 866:0 respectively. Calculate the beta constants correct to three decimal 
places. 


(approx. ) 


2 2 
M3 (29-26) 
: p} =—> = ——~ =0-113 (approx. ) 
Sol 1 ua? (19-67) ( 


by 866-0 
By Se 
Hy (19-67)° 


2.4-5. For a symmetrical distribution all moments about the mean of odd order are 


=2-+238 


zero. 
For a symmetrical distribution the frequencies are symmetrically distributed about the 
mean i.e., the values equidistant from the mean have equal frequencies. Let x be the variable 


and x its A.M. 

Let y=x-x. 

Let x,, x. be the values of x equidistant from x. 

Then the quantities (x, — x) and (x, ~—X) are equal in magnitude but opposite in signs. 
Let these quantities by y, and —y,. Then since the distribution is symmetrical the values 
—y, and y, ofy have same frequencies f, each. Let other values of Y be —y2; y.,—y3, 3 


and so on. Let f,, fy... be the frequencies for —y; y.,— 33 Yzeeeeee Let N be the total 


frequency. 
Now by def., 


Moret 


l o\2r+l 
= —Uf(x-x 
woe 


! 2r+l 
=—y 
yw 


1 . 
= ULOl: +1 ay BOs =i be ee =0. 


2.4-6. Show that for a discrete dist B, > 1. 


By def B, = 4 


Ho 
A 6e bit tyou,? 


I< “. ; 1 <t ae “ = 
Le, — (x; - >4— (x; —X)" here xX = A.M. 
a > ACH?) (i D I; -x) where 
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Ie ct 


n n A 
ie, NOAM {S. \ 
i=l i=] 


ie, (fit frtothr Ayr +4 
Le, FAY + Paya teeeect, 
Peere MC tp as 


; 2 
ie, fifx( —Y2) +--+? 0 
which is true as each term on the Ie 


Theorem 2.4-7. Show that for 


Proof. 
By def, 


” By >By if 


1.e., 
Le, 1S fu, -3 , eZ 
N = t { N 


ie, Sa. [S| > 
isl i=l 


4 
i.e., (fi + faq tees +hn 


2 2 
iLeé., fin’ (fin + foyg ted 


, 2.2 2 
1€., fifoyy Yo V1 ~Y2) te 
which is true as each term on left 


*. By >Bh. 
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_ 218-424 
(8-808)? 


of a variable characteristics are 
onstants correct to three decimal 


} (approx. ) 


tbout the mean of odd order are 


nmetrically distributed about the 
frequencies. Let x be the variable 


magnitude but opposite in signs. 
oution is symmetrical the values 
values of Y be —y2; y9,-y3, y3 
Vai Varceeeeeens Let N be the total 


Sh. 
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2 


ie., NY fi fs {Six where y, =(x,; -x) 


i.é., fifo 4 five + oeeecons + fn + fof +fyyo¥ Meld fy") 


BD 
bet FAI + foye tet baYn > AI tat ALi n tA AN ot 


ie, fi fol, Yo) tee > 0 


which is true as each term on the left is positive. 


Ee B, >1. 
Theorem 2.4-7. Show that for a discrete dist. 
By > By 
Proof. 
By def, 
2 
p= —4+ .B, = — 
ba” Ho 
Bs > By if 
2 
a4 _ Bs 
Ho” He 
: 2 
Le, Mal, > H3 


2 
. 1 “ -—\4 1 - =\2 1 . =3 
an te ji WW i%j— ar i(%; — x) 
i.e (eS fe ath ES se x) js [5 ne | 


2 
ie, [Ss04)(S8)-[E0") where y,; =x; -—* 
i=l i=l i=l 


ie, (fi + foyn' + Sask +frdn } (fin + Seasats +frYn) 


2 4 2 2 2 
Le., Ais’ (fin? + fava tt Sdn )+ far (fi + foya ttSaVn le 


2. 6 4)- 53 
> fy + fy yoo tt ty Yn +2fi fay Yo to 


. 2,2 2 
Le, fi fay Yo (Oy —Yo)o tee >0 
which is true as each term on left is non-negative. 


“. Bo > By. 
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2.4-8, Expression of first four factorial moments in terms of ordinary moments about 
origin and conversely. 
Factorial moment of order ‘r’ about the origin is defined by 


! ) 
Hiry = won : 
i= 


where yi) = X(x-1)....Qe-r +1) 


4 (0) om 


Now x4 = x(x—D(x-2)(x 


14 (0) a 


1 n 1 n 3 
Now Bay = ot 3 yaaa = 11(0) 2.4.9 Absolute moment of 
i=l i=l 


lx » 1X 
Hoa) = yD Aiai = fixi(xi -D 
i=l i=l 


and the absolute moment of ord 


I~. 2 1X 

= fin - SY fins = BO -H}O) 
ist i=l 

Evidently mean deviation i 


( > . 2-34. D absolute : 
Ho) = Pale: —1)(%; -2) Ex aie 
i=l 


ly 3 2 ' ' ' where A, is the rth absolute m 
= Dahil — 3x," +2x;} =p; (0)— 3p, (0)+2p, (0) 
isl , 


Let a and b be any two nui 
DA ( 
i=l 

ie, > fila 

_ isl 


Mia) = hin — I(x; -2)(%; - 3) 


i=l 


& 14 (O)~6p13' (0) + 11H’ 0) ~ 6p’ (0) 


Conversely : pp (0) = Hay 


; Limp a dw 
Hea) 0) = ay DSR = Hy DFP D4) 


i=| 


1 n 

2 

i.e, a—) fily, 
yd 


Put yp =X; 


= Hay +Hay 


' ‘1x 
3 (0) = yun 
i=l 


1 A 
Then a? sis 
Let x? = x(x-1)(x—-2)+ Ax(x-1) + Bx jel 
Put . x =1,2 
. B=1and2A+2B=8 or A=3 


wee x(x —1)(x-2)+3x(x-l) +x 


i... a7A,_) +b7A,41 + 2¢ 


2 
A 
Le, Aaa 7 | + 
A, 


: iat 
“(y= Fy Da fils ~1)(x; —2)+3x,(x; -1+x;} 
i=l 
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rms of ordinary moments about ' ' ' 
= HG) + 3H) + Bay 
ed by 


’ 1 . 4 
ba (0) = woah 
Now x? = x(x —1)(x —2)(x — 3) + 6x(x -1)(x-2) + Tx(x-1) +x 


. “ 4 (0) = Hy + 6H) +7H a) +a) 
= 1)(0) 2.4.9 Absolute moment of order r about the origin is defined by 


— > filxil’ 
yea 


(x; -1) 
and the absolute moment of order r about any arbitrary pt ‘a’ is defined by 


' ; ly 
- 15(0) -14(0) — Sfx; -al’ 
y afl 
Evidently mean deviation is the first order absolute moment. 
Ex. 2-34. Define absolute moments. Show that 
AP eA aA oe 
= 13'(0)- 315' (0) + 2n,'(0) where A, is the rth absolute moment about point. Deduce that 


1 rely 2ecise 


es 
A,r = Arf 


x; ~3) Let a and b be any two numbers. Then 


“ rat? 
YH 2 +dly,| 2 | 20 
i=l 


0) - 6, (0) 
ié., > fila? + by" +2ably;|"} 20 
x; ~1)+x;} j=l 
f. me 1 a r 
i.e’, a” —> fly +b° = abl +2ab—-Y Filvl >0 
N i=l N i=l i=] 
Put y, =x; -§. 
21 . ce | 1 - r 1 ‘ r 
Bx Then Pe -€| +b wale -é| . ab Db -€| 20 
i= i= is 


ie, aA,_,+b7A,,) +2abA, 20 


x 4 2 AZ 
; 1.@., Af i : | aang : |e >0 (if A, #9) 


r-l 


+ 3x;(x; -1)+x;} 
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A,’ 
» Arg 720 oF Api Ay 2 A, 


r-l 


Ao s Ary Arey > 
Put r=1,2,3...r 


Ay’ $ ApAp 


n 
Multiplying and using Ap = >, filx; = e° =1 
i=l 


AM Al for r=1,2..... 


to 
r r+] 
Ay s AY 


Ex. 2-35. Show that if the class interval of a grouped dist is less than one-third of the 


calculated s.d. Sheppard's adjustment makes a difference of less than 5% in the estimate 
of s.d. 
Sol. From Sheppard’s correction 


2 
H, (corrected) = pt, -—— 


12 
Let M5 (corrected) = 6,” and p, =0* 
it 
2 \2 
Then oO, = Bek 
12 
1 
2 2 2 
= o{l1-——~ |] =o|1-——+t...... 
120? 40? 
h2 
o-o, =>— 
240 
; 1 
Now h<-7o 
3 
o 
o-0; 
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2.5. Skewness 

For a symmetrical distributic 
frequencies is same on both sides 

A distribution which is not sy: 
lack in symmetry. It may be positi 


and median lies in between the tw 
tail on right) and in negatively ske 
Skewness is measured by eitl 
‘Skewn 

Skewn: 

Skewn 


For moderate skewed distrib: 


mean : 


Co-efficient of Skewness 
Bowley’s co-efficient of Skev 


Karl Pearson’ co-efficient ¢ 


2nd formula is used when m 


In terms of 8, and B., 


Coeff. of skew1 


This is also zero for symme! 
Ex. 2-36. Compute Q.D. an 
Size 
4—8 
8—12 
12—16 
16—20 
20-——24 
Sol. We have 
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G20). Ne de: 
o 200 2 
2.5. Skewness 
For a symmetrical distribution mean, mode and median coincide. The spread of the 
frequencies is same on both sides of the mean. . 
A distribution which is not symmetrical is called skewed distribution. Skewness means 
lack in symmetry. It may be positive or negative. In positively skewed distribution. 


mode < mean 


and median lies in between the two. Curve is more elongated to the right (7.e., has a longer 
tail on right) and in negatively skewed distribution reverse is the case. 
Skewness is measured by either of the formulae : 
Skewness = mean — mode 
Skewness = 3 (mean — median) 
Skewness = Q, +Q, —2 (median) 
For moderate skewed distributions 


1 
mean — median = 3 (mean — mode) 


Co-efficient of Skewness 
Bowley ’s co-efficient of Skewness 


Q; +Q, -2 Median 


ist is less than one-third of the 23 -Q, 
1 Karl Pearson’: co-efficient of Skewness 
less than —% in the estimate a ’ 
e a Maun Mou 
S.D. 
_ 3(Mean — Median) 
S.D. 


2nd formula is used when mode is ill-defined. 


In terms of B, and B,, 


Coeff. of skewness = 2(5B, - 6B; -9) 


This is also zero for symmetrical distribution as B, = 0. 
Ex. 2-36. Compute Q.D. and co-efficient of skewness from the following data. 


a h? 4 Size Freq. Size Freq. 
2407 4—8 6 24—28 12 
8—12 10 28—32 10 
12—-16 18 32—36 6 
16—20 30 ' 36—40 2 

20—24 15 

Sol. We have 
Q, = 14-5 
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Median has S = 54-5 items below it. 
.. It lies in 16 — 20 
= . 4 
Median = 16+—(54-5— 34) 
30 
= 16+ x (20-5) 
30 
= ibe 16+2-73 
30 
= 18-73 
Bowley’s coefficient of skewness 
. _ Q3+Q,-2M 
Q; = 2d, 
_ 24-92+14-5-37-46 
24-92-14-5 
_ B942=37-46.. 1:96: 
10-42 ~ 10-42 2 
a 
= 0-19. Ss 
Ex. 2-37. From the data given below calculate Karl Pearson & co-efficient of skewness : 2 
Marks less than 10 No. of students 5 = 
Marks less than 20 No. of students 12 & 
Marks less than 30 No. of students 32 S 
Marks less than 40 No. of students 44 g 
Marks less than 50 No. of students 50 3 
S 
Sol. & 
Calculation of S.D. and A.M. 
es = 
Class intervals Freq. () Mid-points 
0—10 


Given 
Frequency c. Freq. 


7 
= 25+10) — |=26-4 
ie an 


2 
S.D. = 10 =-(2 == 310 =11-14. 


For calculation of mode see next page. 


Determination of Model class. 


Class 
intervals 
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61:97 = OI x 
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+ = spoyy 
L-02@ a 


O€ — OZ SI SSEID TePoY| *- 


aque, sIsAjeuy 


mn 8 co-efficient of skewness : 


dents 5 
tdents 12 
dents 32 
dents 44 
dents 50 


6 
42 
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Mean — Mode 
S.D. 
_ 26-4—26-19 
LAS 
Ex. 2-38. (a) Find co-efficient of variation if S.D.=3-5, N=10, ux = 145. 
(b) Find the co-efficient of skewness if 


Co-efficient of skewness = 


=0-02 


Difference of the two quartiles = 8 
Sum of the two quartiles , = 22 
Median =10:5 


(c) Fora series the value of M.D. is 15. Find the most likely value of its Q.D. 
Sol. (a) Let x be the A.M. 


x 145 


Then 31. Mo x 
en === 
.. Co-efficient of variation 
S.D. 
= | —— |(100 
sa} 
3-5 
= | —— |(100 
(=) ) 
= aS =24-14% 
145 
(b) Here 0;-Q, =8 
Q; +Q, = 22 
Median = 10-5 
+Q, —2 (Medi 
Co-efficient of skewness = Q3 + Q, ~2 (Median) 
Q; -Q, 
7 22-21 = 1 seine 
8 8 
(c) Let o be the S.D. 
Then M.D. = <9 
2 
and Q.D. = =95 
3 
M.D. ole 26 
QD. 10 5 
5 
QD. = g™D.) 
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Most likely value of Q.D. : 
Ex. 2-39. Compute quartile de 
following values : 


Median = 18-8'',Q, =14-6"',Q 


Sol. Q.D. 


Co-efficient of skewness 


Ex. 2-40. (a) Karl Pearson's co- 
6-5 and mean is 29-6. Find the mode 

(b) If the mode of the above dist 

Sol. (a) We have 

Karl Pearson’s co-efficient of sk 


0:32 
Mode 


Also 
Karl Pearson’s co-efficient of sl 


“. Median is given by 


0-32 


Mediar 


(b) Mean = 29-6, Mode = 24°8 
Karl Pearson’s co-efficient of s 


“. S.D. is given by 


0-32 
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Most likely value of Q.D. = 12:5. 
Ex. 2-39. Compute quartile deviation and the co-efficient of skewness, given the 
following values : ; 


“02 Median = 18-8", Q, =14-6"',Q, =25-2" 
Q;-Q, _25:2-14-6 _ 
2 2 


5, N=10, Lx =145. Sol. Q.D. = 5:3 


Co-efficient of skewness 


_ Q3+Q,-2 (Median) + 


st likely value of its Q.D. lle 
= 39-8-37-6 = 2:2 =0-2, 
10-6 10-6 
Ex. 2-40. (a) Karl Pearsons co-efficient of skewness of a distribution is 0-32. Its s.d. is 
65 and mean is 29-6, Find the mode and median of the distribution. 
(b) If the mode of the above distribution is 24-8, what will be the standard deviation ? 
Sol. (a) We have 
Karl Pearson’s co-efficient of skewness 
_ Mean -Mode 
S.D. 
. _ 29:6 - Mode 
0-32 at aa 
s Mode = 29-6—(0-32)(6-5) = 27-52 
Also 
Karl Pearson’s co-efficient of skewness 
_ 3 (Mean - Median) 
n) S.D. 
“. Median is given by 
: 0-32 = 3(29 -6 — Median) 
5 6-5 
1 
Median = ae (0-32)(6-5) 


29-6~— (2-08) 


= 29-6-—0-69 = 28-91. 
(5) Mean = 29-6, Mode = 24:8 
Karl Pearson’s co-efficient of skewness 


= 0:32 
.. S.D. is given by 
29-6—24. 
0-32 = 29-6-24-8 
S.D. 
4-8 480 
ie. ~ §.D.= a Ee 


0-32 32, 
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2.6. Kurtosis 


Let there be two frequency distributions which have same variability as measured by . 


the standard deviation. Their frequency curve may not be equally flat at the top. The flatness 
of top of a frequency curve is measured relative to that of normal curve. This relative flatness 
is called kurtosis. 

Measure of kurtosis tells us the extent to which a frequency curve is more peaked or 


flat-topped than the normal curve. Kurtosis is measured by B,. For normal curve, value of 
By is 3. 
For curves which are more peaked than the normal curve, B, >3. Such curves are 


called leptokurtic and for curves which are more flat-topped than the normal curve, B, < 3. 


Such curves are called platykurtic. 
The normal curve itself is called mesokurtic. 


The greater is the value of B,, the more peaked the distribution is. 
Sometimes y. =f, —-3 is taken as a measure of kurtosis. 

For a normal distribution, y, = 0 

If y. >0, the curve is more peaked (i.e., leptokurtic) 

and if y. <0, the curve is more flat-topped (i.e., platykurtic). 


EXERCISES 


1. The following table gives the dist. of plots according to their sizes in a given region. 
_ Calculate the quartile deviation. (Size of the form is rounded to the nearest acre). 


Farm size No. of farms Farm size No. of farms 
(in sq. metres) (in sq. metres) 

0—40 394 161—200 169 
41—80 461 201—240 113 
81—120 391 241 and over 148 

121—160 334 


Also calcflate the quartile co-efficient of dispersion. 
[Ans. 50:96; 0°51] 
2. The following table gives the frequency dist. of 290 workers of a factory according to 
their average monthly income in 2000-01. 


Income group No. of workers Income group No. of workers 
Below 50 1 150—170 22 
50—70° 16 170—190 15 
70—90 39 190—210 15 
90—110 58 210—230 9 
110—130 60 230 and above 10 
130—150 46 


Locate the quartiles and hence calculate co-efficient of dispersion. 
[Ans. 95-78; 149- 24; 0-22] 
3. Calculate the S.D. of each of the following sequences of binomial co-efficients : 
(@) 1,5, 10,5, 1. 
(it) 1, 6, 15, 20, 15, 6, 1. 
(iii) 1, 10, 45, 120, 210, 252, 210, 120, 45, 10, 1. 
[Ans. 3-323; 6-958; 90-17] 


MEASURES OF DISPERSION AND SKE 


4. Calculate the second mome: 


' following dist. : 
x: 0 1 2 
f: 1 9 26 


5. Calculate the S.D., for the fc 


emplicyees of a big factory : 


Wage (in Rs.) 50— 
No. of employees 25¢ 
25-3 
110 


6. Calculate the S.D. of the foll 


Marks No. 0} 
More than 0 
More than 10 
More than 20 
More than 30 


7. Calculate the variance of the 


Marks No. 0j 
30—35 
35—40 
40—45 
45—50 


8. Calculate the variance of the 


Marks No. 0j 
10—14 
14—18 
18—22 
22—26 
26—30 
30—34 


9. Find the standard deviation a 


Wages No. 0, 
Up to Rs. 10 
Up to Rs. 20 
Up to Rs. 30 
Up to Rs. 40 1 


10. Find out (a) median co-effici: 


from the following data : 
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ve same variability as measured by . 


> equally flat at the top. The flatness 
‘normal curve. This relative flatness 


Tequency curve is more peaked or 


by B2. For normal curve, value of 


al curve, B, >3. Such curves are 


ped than the normal curve, B, <3. 


distribution is. 


tosis. 


>) 
atykurtic). 


ig to their sizes in a given region. 
‘rounded to the nearest acre). 


: size No. of farms 
netres) 

-200 169 
-240 113 

nd over 148 


[Ans. 50-96; 0-51] 
workers of a factory according to 


group No. of workers 


170 22 
190 15 
210 15 
230 9 
id above 10 
of dispersion. 


[Ans. 95-78; 149-24. 0-22] 
8 of binomial co-efficients : 


[Ans. 3-323; 6-958; 90-17] 
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4. Calculate the second moment about the mean and co-efficient of variation of the 


' following dist. : 


x 0 1 
f: 1 9 


2 3 
26 59 


5 
52 


7 8 
7 1 
[Ans. 1-98; 35-4%] 


5. Calculate the S.D., for the following data relating to the weekly wage dist. of 5000 
emplcyees of a big factory : 


Wage (in Rs.) 50—55 45—50 40—45 35—40 30—35 
No. of employees 250 300 400 450 800 
25-30 20-25 
1100 1700 
[Ans. 9-024] 
6. Calculate the S.D. of the following dist. : 
Marks No. of students Marks No. of students 
More than 0 100 More than 40 25 
More than 10 90 More than 50 15 
More than 20 75 More than 60 5 
More than 30 50 More than 70 0 
[Ans. 15-94] 
. Calculate the variance of the following dist : 
Marks No. of students Marks No. of students 
30—35 5 50—55 16 
35—40 7 55—60 12 
40—45 8 60—65 7 
45—50 20 65—70 5 
80 
[Ans. 82-44] 
. Calculate the variance of the following data : 
Marks No. of students Marks No. of students 
10—14 2 34—38 10 
14—18 4 38—42 8 
18—22 4 42—46 4 
22—26 8 46—S0 6 
26—30 12 50—54 2 
30—34 16 54—58 . 4 


[Ans. 110-15] 


. Find the standard deviation and co-efficient of variation from the following data : 


Wages No. of persons Wages No. of persons 
Up to Rs. 10 12 Up to Rs. 50 157 
Up to Rs. 20 30 Up to Rs. 60 202 
Up to Rs. 30 65 Up to Rs. 70 222 
Up to Rs. 40 107 Up to Rs. 80 230 


[Ans. 17:26; 42-69%] 


. Find out (a) median co-efficient of dispersion and (b) mean co-efficient of dispersion 


from the following data : 
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12. 


13. 


14, 


15, 


16. 


Age-group 

(in years) 
15—20 5 
20—25 9 
25—30 82 
30—35 58 


No. of men 


Age-group 
(in years) 
35—40 


MATHEMATICAL STATISTICS 


No. of men 


49 
28 
6 
3 


[Ans. 0-17; 0-16] 
11. Calculate co-efficient of variation of the marks of 40 students given below : 


Marks Students Marks Students 
80—-84 1 50—54 6 
75-79 1 45-—-49 6 
70—74 1 40—44 6 
65—69 4 35—39 3 
60—64 4 30-—34 0 
55—59 7 25—29 1 

40 


[Ans. 21-79%] 
A sample of 5 items is taken from the production of a firm. Length and weight of the 5 
items are given below: 
Length : 3 4 6 7 10 
Weight : 9 11 14 15 16 
By comparing the co-efficients of variation of two characters, conclude which of them 
is more variable. 

[Ans. Length] 

During the first 10 weeks of a session the marks of two students X and Y taking the 
course were : 
X: 58 59 60 54s 65 66 52 75 69 52 
Y: 56 87 89 78 71 73 84 65 65 46 
Which of the two is more consistent ? 


[Ans. X] 
The scores of two golfers for 24 rounds each are: 
A: 74 75 78 78 72 #77 79 78 81 76 72 #72 «77 
74 70 78 79 80 81 74 80 75 71 73 
B: 86 84 80 88 89 85 86 82 82 79 86 80 82 
76 86 89 87 83 80 88 86 81 84 87 
Which may be considered to be more consistent? 


[Ans. B] 
Goals scored by two teams A and B ina football season were as follows : 
No. of Goals scored in a match : 0 1 2 3 4 
No. of matches by A : 54 18 16 10 8 
No. of matches by B : 34 18 12 10 6 
Which team is more consistent and why? 

[Ans. B] 


The following table gives the dist. of house-holds according to size in two cities A 
and B. 


Size of house-hold City A City B 
1 24 14 
2 10 10 
3 12 12 
4 15 13 
5 13 14 
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6 
7 
8 


Derive a measure to study the 


17. The index numbers of prices 
follows : 
Month In 
c 
Jan. 
Feb. 
March 
April 
May 
June 
July 
August 
September 
October 
November 
December 
Which of the two shares do y 


18. Calculate Karl Pearson’s co-1 
Marks No. 0 
Above 0 

Above 10 

Above 20 

Above 30 


19. The table below gives ages of 
Age 
(in months) 
15—16 
17—18 
19—20 
21—22 
23—24 
25—26 
27—28 
29—30 
31—32 
33—34 
35—36 
37—38 
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roup . No. of men 
ars) 

40 49 

45 28 

50 6 

55 3 


[Ans. 0:17; 0:16] 
students given below: 


ks Students 
54 6 
49 6 
44 6 
39 3 
34 0 
29 1’ 
40_ 


[Ans. 21-79%] 
irm. Length and weight of the 5 


7 10 
15 16 
acters, conclude which of them 


[Ans. Length] 
vo students X and Y taking the 


52 75 69 52 
84 65 65 46 


[Ans. X] 
81 76 72 72 #77 
75 71 73 
82 79 86 80 82 
81 84 87 
[Ans. B] 
i were as follows : 
2 3 4 
16 10 8 
12 10 6 
[Ans. B] 


‘ording to size in two cities 4 


B 


6 
7 
8 
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10 : 11 

6 10 
10 16 
100 100 


Derive a measure to study the variability of the dist. 


[Ans. A : 59-66%; B : 51-47%] 


17. The index numbers of prices of cotton and jute shares in a particular year were as 


follows : 
Month 


Jan. 

Feb. 
March 
April 

May 

June 

July 
August 
September 
October 
November 
December 


Index no. of prices Index no. of prices 
of cotton shares of jute shares 
188 “131 
178 130 
173 130 
164 129 
172 129 
183 120 
184 127 
185 127 
211 130 
217 137 
232 140 
240 142 


Which of the two shares do you consider to be more variable in price ? 


[Ans. Cotton shares] 


18. Calculate Karl Pearson’s co-efficient of skewness from the following data : 


Marks 
Above 0 
Above 10 
Above 20 
Above 30 


Age 
(in months) 
15—16 
17—18 
19—20 
21—22 
23—24 
25—26 


No. of students Marks No. of students 
1500 Above 40 780 
1400 Above 50 700 
1000 Above 60 300 
780 Above 70 140 
Above 80 0 
(Ans. 0-995] 
19. The table below gives ages of children in two nursery schools. Compare their variability : 
No. of children No. of children 
school A school B 

— 1 

1 2 

2 2 

5 5 

7 10 

9 7 

8 6 

5 3 

3 3 

1 1 

1 — 
— 2 
42 42 


(Ans. Ages of children in school B are more variable, 
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20. Find the co-efficient of variation and Pearson’s co-efficient of skewness from the 


following data : 

Vous Price Index Vine Price Index 
No. of wheat No. of wheat 

1990 83 1995 126 

1991 87 1996 130 

1992 93 1997 118 

1993 100 1998 106 

1994 124 1999 104 


(Ans. 14-85; 0-396) 
21. Find Pearson’s measure of skewness for the data given below : 


ae No. of persons gt No. of persons 
70— 79-99 12. 110—119-99 50 
80— 89-99 18 120—129-99 45 
90— 99-99 35 130—139-99 20 
100—109-99 42 140—149-99 8 


(Ans. —5-719) 
22. Find the co-efficient of dispersion and Pearson’s co-efficient of skewness for the 
following data : 


Wage Wage 


‘ (in Rs.) _ No. of persons (in Rs.) No. of persons 
70— 80 12 110—120 50 
80— 90 18 120—130 45 
90—100 35 130—140 20 

100—110 42 140—150 8 


230 


(Ans. 0-16; -0:33) 
23. For data in Ex. 2 locate median, quartiles and hence co-efficient of skewness. 
(Ans. 0:08) 
24. Calculate the Pearson’s co-efficient of skewness for the following dist. of weights of 
boys : 


ee ; Frequency ps ae . Frequency 
20:5 — 23-5 17 29°5 — 32:5 194 
23:5 — 26°5 193 32°5 — 35-5 27 
26°5 —29'5 399 35:5 — 38-5 10 
(Ans. 0-07) 
25. The first three moments about the origin are 
i = sine, HW =F (n+ Qn +1) 


1 2 
f —n(n+l1 
H3 4 (n+l) 


Examine the skewness of the data. 
26. Let x be a random variable with mean p and variance g2. Show that E {(x-5)*} as 
f” of b, is minimized when b = up. 
27. Show that the co- efficient of skewness ranges from —1 to 1 
(Hint : use Q, <Q, <Q;). 
00 
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frequencies. 

Positive and Negative Attribt 
positive attributes and those denot. 
symbol includes only capital letters 
and if only Greek letters, a negative 
a. B is negative class. 

Symbol. A symbol ‘A.N.’ is use 

AN = (¢ 
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To find classes of order 2, cons 


the classes 
A\A 


MATHEMATICAL STATISTICS 


efficient of skewness from the 


s Price Index 
No. of wheat 

5 126 

6 130 

7 118 

8 106 

P) 104 

(Ans. 14-85; 0-396) 

below : 

ht 

) No. of persons 

9-99 50 

9-99 45 

9-99 20 

9-99 8 


(Ans. ~5-719) 
efficient of skewness for the 


) No. of persons 
20 50 
30 45 
40 20 
50 8 
230 
(Ans, 0-16; -0-33) 
-efficient of skewness. 
(Ans. 0-08) 


2 following dist. of weights of 


) Frequency 
a) 194 
Pe) 27 
he) 10 
(Ans. 0-07) 
t+1)(2n+1) 


2. Show that E {(x-b)"} as 


1 


(Hint : use Q, <Q, <Q;). 
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3.1. Introduction 

Attribute means quality or property. The capital letters A, B, C, .......- are used to denote 
the several attributes. An object or individual possessing the attribute A is termed simply A 
and the class of individual possessing A is termed as the class A. The Greek letters a, B, Y 
.... are used to denote the absence of attributes A, B, C, ...... respectively e.g., if A represents 
the attribute honesty, 0. represents dishonesty. The combination of attributes is represented 
by grouping the letters representing the attributes, e.g., if A represents deafness and B 
blindness, AB represents combination deafness and blindness. 

Any letter or combination of letters like A, AB by means of which the characters of the 
members of a class are specified are called class-symbol. 

Class-frequencies. The number of observations in any class is called the class- 
frequency. It is denoted by enclosing the corresponding class-symbol in brackets. e.g., (A) 
denotes the number of objects or individuals possessing the attribute A. 

Order of Classes and Class-frequencies. A class specified by r attributes is known as 
aclass of order r and its frequency as a frequency of rth order, e.g., A, AB are classes of first 
and second order and (A), (AB) are frequencies of order one and two respectively. 

The total frequency is denoted by N. 

Ultimate Class-frequencies. The frequency of highest order are called ultimate class- 
Srequencies. 

Positive and Negative Attributes. The attributes denoted by capitals are termed as 
positive attributes and those denoted by Greek letters as negative attributes. If a class- 
symbol includes only capital letters, the corresponding class is termed as a positive class 
and if only Greek letters, a negative class e.g., the class AB is positive class and the class 
a B is negative class. 

Symbol. A symbol ‘A.N.’ is used for the dichotomising N according to A and is written 

A.N = (A) 
which is the symbolic way of saying that if N is dichotomised according to A, class-frequency 
(A) is obtained. 

Condition of Consistence. The necessary and sufficient condition for the consistency 
of a set of independent class-frequencies is that no ultimate class-frequency is negative. 

Ex. 3-1. Show that if there are n attributes, the number of distinct classes is 2: 

Sol. There is only one class of order zero, If Aj, Ao, .... An be 1 attributes the possible 
classes of order one are 

Ay, Ag, wee Ags 01, 02, «-. On (O’s denoting the absence of A’s) which are 2n = "cy. 2 in 
number. 

To find classes of order 2, consider two attributes A; and A). These two attributes give 
the classes 

A\A, A\Q2, 0,42 and QQ, 
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of order two. These are 2” = 4 in number. Since out of n two attributes can be chosen in "c, 

ways, total number of classes of order two 
= "Cy a 2 

Similarly the number of classes of order 3 

= "C3 ci 2° 

and in general the number of classes of order r 
= "c,.2" 

Total number of classes 

= 14+" .24+%cy. 2 tt "cp. WH... + Mey 2" 
= (1+2)’=3". 

Ex. 3-2. A number of school-children were examined for the presence or absence of 
certain defects of which three chief descriptions were noted : A, development defect; B, 
nerve signs, C, low nutrition. Given the following ultimate frequencies, find the frequencies 
of the classes defined by the presence of the defects i.e., those involving the Roman letters. 
A, B, C but not the Greek letters a, B, y including the whole number of observations N. 


(ABC) 57 (aBC) 78 
(ABy) 281 (oBy) 670 
(ABC) . 86 (aBC) 65 
(ABy) 453 (ay) 8310 


Sol. N = Total no. of observations 
= Sum of frequencies given above = 10,000 
Now _ (A) = (AB) + AB) 
(AB) = (ABC) + (ABy) 
(AB) = (ABC) + (ABy) 
(A) = (ABC) + (ABy) + (ABC) + (ABy) 
57 + 281 + 86 + 453 = 877 


u 


Similarly (B) = (ABC) + (ABy) + (ABC) + (oBy) 
= 57+281+78 + 670 = 1,086 
(C) = (ABC) + (ABC) + (ABC) + (aBC) 
= 57+86+78 +65 = 286 
(AB) = (ABC) + (ABy) = 57 + 281 = 338 


(BC) = (ABC) + (aBC) = 57+ 78 = 135 
(AC) = (ABC) + (ABC) = 57+ 86 = 143 


Also (ABC) = 57. 
Ex, 3-3. From the frequencies given below find all the class-frequencies. 
N = 10,000, (A) = 877, (B) = 1086, (C) = 286 
(AB) = 338 (AC) = 143, (BC) = 135, (ABC) = 57 
Sol. Now (AB) = (ABC) + (ABy) 
AD (ABy) = (AB) ~(ABC) = 338 ~ 57 = 281 
Similarly (ABC) = —(ABC) + (AC) = 143 —- 57 = 86 
(ABC) = (BC)-(ABC) = 135 ~57 =78 
Now (ABy) = (4B) ~ (ABC) 


= (A)~(AB)~ (ABC) 

= 877 ~338 ~ 86 =453 
(aBC) = (BC) (ABC) =(C)— (BC)- (ABC) = 286 ~ 135 - 86 =6 
(aBy) = (a) - (BC) | 
-  =2{di) - (aB) ~ (oC) 

= {N= (A)} — {(B)- (AB)} = (0BO) 


THEORY AND ASSOCIATION OF AT 


(aBy) = 


Thus all the ultimate freque: 


obtained as in last example. 


Ex. 3.4. Show that A+ a= 


a.N=(a) and 1. N= N Deduce 


(oBy) - 
Sol. By def. A.N=(A) ar 
: A.N+a.N ? 


eo (A+0).N = 


At+a: 
me a: 
Similarly B : 
a (ay) = 


Ex. 3-5. Given that 


(A) : 
Show that (AB) = (8), (AR 
- Sol. (AB) : 
(AB) » 


Ex. 3-6. Given that (A) 


and also that (ABC) 
show that 2(ABC) 
Sol. (ABC) 
2(ABC) 


Ex. 3-7. Measurements are 
measurements of the husbands 
measurements, in 700 cases foi 
many cases will both measurem 

Sol. Let A and B denote the 


respectively. Then 
N 


(8) 
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a ee oN dae me on Oe 


ned for the presence or absence of 
noted : A, development defect; B, 
ite frequencies, find the frequencies 
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78 


000 


c) + (ABy) 

= 877 

C) + (aBy) 
= 1,086 

C) + (BC) 
86 

281 = 338 
- 78 = 135 

-86 = 143 


he class-frequencies. 


6, (C) = 286 
5; (ABC) = 57 
57 =281 

~ 57 =86 

57 =78 


C)— (ABC) = 286- 135-86 = 65 


)} = (aPC) 
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10,000 — 877 — 1086 + 338 — 65 = 8310 
(0B) — (ABC) 
= (B) - (AB) —(aBC) = 1086 — 338 — 78 = 670. 

Thus all the ultimate frequencies are known. From these remaining frequencies can be 
obtained as in last example. 

Ex. 3.4. Show that A + « = 1 where A, a and | are operators defined by A.N. = (A), 
a.N=(a) and 1. N= WN Deduce that 

(aBy) = N-(A)-(B)-(C) + (AB) + (AC) + (BC) - (ABC) 
Sol. By def. A. N = (A) anda. N=(q) 
: A.N+a.N = (A)+(Q=N 


(aBy) 


or (A+a).N=1.N 
, A+a=1 
i a = 1-A. 
Similarly B = 1-Bandy=1-C 


(aBy) = oBy.N=(1-A)(1-B)(1-OE).N 
= (1-A-B-C+AB+AC+ BC-ABC).N 
= 1.N-~A.N—-B.N-C.N+AB.N+ACN+BCN-ABC.N 
= N—-(A)-(B)-(©) + (AB) + (AC) + (BC) - (ABC). 
Ex. 3-5. Given that 


ul 


N 
(A) (a) = (8) = (8)= > 
Show that (AB) = (a), (AB) = (a8). 
- Sol. (AB) = AB.N=(1-«@) (1-8). 
= {1-a-B+aB}.N 
= N-(a)-(B) + (a) 
= N-2-= + (o8)=(08) 
(AB) = AB.N= {1-0} {1—B).N 
= {l1-a-B+aB}.N 
= N-(a)-(B) + (8) = (a8). 


Ex. 3-6. Given that (A) = (a) =(B)=(—)=(O=(= > 


and also that (ABC) = (aby) 
show that 2(ABC) = (AB)+(AC)+(BO)- > 
Sol. (ABC) = (aBy)=N-(A)-(B)—(C) + (AB) + (AC) + (BC) - (ABC) 


2(ABC) = (AB) +(AC)+ (BO) - >. 


Ex. 3-7. Measurements are made on a thousand husbands and a thousand wives. If the 
measurements of the husbands exceed the measurements of the wives in 800 cases for one 
measurements, in 700 cases for another and in 660 cases for both measurements, in how 
many cases will both measurements on the wife exceed the measurements on the husband ? 

Sol. Let A and B denote the husbands exceeding wives in first and second measurements 
respectively. Then 

N 


(a8) 


1000, (4) = 800, (B) = 700 and (AB) = 660 
oB.N = {1-A} {1-B}.N 
{1—A~B+AB}.N=N-(A)—(B)+(AB) 
= 1000-800 — 700 + 660 = 160. 
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Ex. 3-8. 100 children took three examinations, 40 passed the first, 39 passed the second 
and 48 passed the third. 10 passed all three, 21 failed all three, 9 passed the first two and 
failed the third, 19 failed the first two and passed the third. Find how many children passed 
at least two examinations. 

Show that for the question asked certain of the given frequencies are not necessary. 

Which are they ? 
Sol. Let A, B, C denote passing first, second and third examinations respectively. Then 
‘ N = 100, (A) = 40, (B) = 39, (C) = 48, (ABC) = 10 
(aBy) = 21, (ABY) =9 and (aBC) = 19 


Now (aBC) = aBC.N=(1- A) BCN=(BC-ABC).N 
= (BC) - (ABC) 
and (ABC) = ABCN=(1-0) BCN= {BC- aBC}.N 
= (BC) - (aBC) 
No. of children who passed at least two examinations 
= (ABC) + (ABC) + (ABy) + (ABC) 


= (BC) - (ABC) + (BC) - (aBC) + (ABy) + (ABC) 
= (C)-(aBC) + (ABy) = 48 - 19. +9 = 38. 
Evidently three frequencies have been used and hence others are not necessary. 
Ex. 3-9. Show that if A occurs in a larger proportion of the cases where B is than where 
B is not, then B will occur in a larger proportion of the cases where A is than where A is not. 
Sol. It is given that 


(4B) . (48) 
(B)  — B) 
and it is to be shown that 
(AB) , (a8) 
(A) (a) 
(8) , (AB) 
(B) (AB) 
(B) (AB) 

+ (B) > 1+ (AB) 
(B)+(B) _ (AB)+(AB) 
(B) (AB) 
NS fA 
(B) ~ (AB) 

Ng AB). 
(A) (AB) 
(A) +(@) (AB) +(aB) 
(A) (AB) 


From given 

1 
or 
or 
or 
or 
or Late Pe ee ee 

(AB) _ (aB) 


or pee 


> 
(A) (a) 
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Ex. 3-10. At a competitive examination at which 600 graduates appeared, boys 
outnumbered girls by 96. Those qualifying for interview exceeded in number those failing 
to qualify by 310. The number of science graduate boys interviewed was 300 while among 
the Art graduate girls there were 25 who failed to qualify for interview. All together there 
were only 135 Art graduates and 33 among them failed to qualify. Boys who failed to qualify 
numbered 18. Find out. 

(i) the number of boys who qualified for interview. 
(ii) the total number of science graduate boys appearing. 
(iii) the number of science graduate girls who qualified. 

Sol. Let A, B and C denote the attributes of being a boy, qualified for interview and 
science candidate. Then 

N= 600, (A) - (a) = 96, (B) - (B) =310, (ABC) = 300, (oBy) = 25, 

(y) = 135, (By) = 33 and (AB) = 18. 

Since N = (A) + (a) = 600 = (B) + (B) =(C) + (y) 

(A) = 348, (a) =252, (B) = 455, (B) = 145, (C) = 465; 

(i) (AB) = (A)-(AB) = 348 — 18 = 330 
(ii) (AC) = (ABC) + (ABC) = (ABC) + (AB) - (ABY) 
= (ABC) + (AB) - (By) + (oy) 
= 300+ 18-33+ 25 =310 
(iii) (aBC) = (BC) - (ABC) = {1-B}- (1-). N- (ABC) 
= N-(¥)- (B) + (By) - (ABC) 
= 600 - 135 - 145 + 33 - 300 = 53. 

Ex. 3-11. Jn a free vote in the house of commons, 600 members voted. 300 Government 
members representing English constituencies (including welsh) voted in favour of the motion. 
25 Opposition members representing Scottish constituencies voted against the motion. The 
Government majority among those who voted was 96. 135 of the members voting represented 
Scottish constituencies. 18 Government members voted against the motion. 102 Scottish 
members voted in favour of the motion. The motion was carried by 310 votes. Analyse the 
voting according to the nationality of the constituencies and party. 

Sol. Let-A, B, C denote the attributes of being Government members, voting for the 
motion and being members of English constituencies respectively. 

Then N= 600, (ABC) = 300, (ay) = 25, (A) — (a) = 96, (y) = 135, (AB) = 18, (By) = 102 
and (B) — (B) = 310. 

It is required to find all ultimate frequencies. 

Now N = (A)+(a) =600 =(B) + (B) 

ne (A) = 348, (ao) = 252, (B) = 455 and (B) = 145. 

(C) = N-(y) = 465. 
(AB) = AB.N=A(1-8).N=(A) - (AB) 
= 348 — 18 = 330 
(BC) = (B)-(By) = 455 — 102 = 353 
(aBy) = N-(A)-(B)-(©) + (AB) + (AC) + (BC) - (ABC) 


(AC) = (aBy) + (ABC) + (4) + (B) + (C) - (BC) - (AB) - N 
= 25 +300 +348 + 455 + 465 — 353 — 330 — 600 
= 310. 
Now (ABy) = (AB)-— (ABC) = 330-300 = 30 


(ABC) = (AC)-(ABO) = 310 - 300 = 10 

(aBC) = (BC) -(ABC) = 353 - 300 = 53 

(ABy) = (AB) - (ABO) = 18- 10=8 

(oBC) = {1-A}- {1-B} CN={C-AC- BC+ ABC}.N 
= (C)- (AC) - (BC) + (ABC) 
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465 — 310 — 353 + 300 = 102 
(B) — (AB) - (BC) + (ABC) 
= 455 — 330 — 353 + 300 = 72. 
Ex. 3-12. Prove that in case of two attributes A and B, the conditions of consistency are 
(i) (AB)20 (ii) (AB) S (A) (iii) (AB) S(B) sand (i) (AB)2 = (A)+(B)-N. 
Sol. The necessary and sufficient condition for the consistency is that no ultimate 
frequency is negative i.e, (AB) 20, (AB) 20, (a8) = 0 and (af) 2 0. 
Now (AB) = {AB}.N=A{1—B}.N=A.N— {AB}-N 
= (A) - (4B) 
(AB) 2 0 implies (A) = (AB) 
Similarly (a@B) = 0 implies (B) 2 (AB) 
Now © (aB) = {oB).N= {(1-A)(1-B)}.N 
= {1-A-B+AB}.N=N-(A)-(B)+ (AB) 
(aB) = 0 implies (AB) = (A) + (B) —N. 
Ex. 3-13. Prove that in case of three attributes A, B and C, the conditions of consistency 
are 


(aBy) 


(i) (ABC) = 0 


(iii) (ABC) 2 (AB) + (BC) - (B) (iv) (ABC) 2 (AQ) + (BC) -(C) 
(v) (ABC) s (AB) 


(vi) (ABC) s (AC) 
(vii) (ABC) < (BC) 


and (viii) (ABC) s (AB) + (AC) + (BC) -(A)- (8B) -(O+N 
Hence deduce 
(AB) + (AC) + (BC) 2 (A) +(B)+(C)-N 
(AB) + (AC) - (BC) § (A) 
(AB) - (AC) + (BC) < (B) 
~ (AB) + (AC) + (BC) (©) 
Sol. For consistency it is necessary and sufficient that all ultimate frequencies are non- 
negative i.e., 
(i) (ABC) 20 
(ii) (ABy) = Oie, {ABy}.N>0 
ie, {A(1-B)(1-O}.N20 
ie, {A-AB-AC+ ABC}.N 20 
ie, (A)—(AB)—-(AC) + (ABC) 2 0. 
ie, (ABC) 2 (AB) + (AC) -(A) 
Similarly (aBy) 2 0 and (aPC) = 0 implies (iii) and (iv). 
(v) (ABy) 20 
ie, {ABy}.N20 
ie, ABIA-C).N20 
ie, {AB-ABC}.N2>0 
ie, (AB)2 (ABC). 
Similarly (vi) and (vii) follow from (ABC) 2 0 and (aBC) 2 0. 
(viii) (aBy) = 0 ; 
te, {1—-A}- {1-B}- {1-C}.N20 
ie, {1-A—-B-C+AB+AC+ BC-ABC}.N20 
ie, N—(A)—(B)—(C) + (AB) + (AC) + (BC) - (ABC) 20 
ie, (ABC) $(AB) + (AC) + (BC) —-(A)-(B)-(O TN. 
From (i) and (viii) 
(AB) + (AC) + (BC) - (A) - (B)-(C)+ N20 
ie, (AB)+ (AC) + (BOC) 2 (A) + (B) +(O-N 


(ii) (ABC) 2 (AB) + (AC) - (A) 
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From (ii) and (vii) 

(AB) + (AC) -(A) S$ (BC 
ie, (AB)+(AC)-(BO)S(A 
Similarly from (iii) and (vi), (¢ 

(AB) + (BC) - (AC) $(B 
and (AC)+(BC)-(AB)s(C 

- Ex. 3-14. [fa report gives the 
there must be a misprint or mistake 
N= 1000, (A) =510, (8) = 4! 
Sol. Now (AB) +(AC)+ (6 
and (A) +(B) +(C)- 
(AB) + (AC) + (BC) <(4 
.”, Data is not consistent and | 
Ex. 3-15. /fin an urban distr 
years of age were returned as “occ 
widowed, what is the lowest propo: 
have been occupied ? 

Sol. Let A and B denote the att 
respectively. Then 

N= 1000, (A) = 817 and (B): 

’. (4B) 2 (4) + (B)-N=i 

. Lowest proportion per the 

occupied 


Ex. 3-16. A market investigat 
811 liked chocolates, 752 liked tofj 
toffee, 356 liked chocolates and bi 
liked all three. Show that this info1 
Sol. Let A, B and C denote t 
boiled sweets respectively. Then 
N= 1000, (A) =811, (B 
and (ABC) = 297. 
Now (aBy) = 
Information is incorrect 
Ex. 3-17. In a war between 
White; there are more armed Whit 
ammunition than unarmed White: 
Reds without ammunition than un 
Sol. Let A, B and C denote t 
ammunition. Then 
(A) <(@ 
and it is to be shown that 


Now (aBy) = 
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0 = 102 

4BC) 

0=72. 

3, the conditions of consistency are 
and (iv) (AB)= (A)+(B)-N. 
ie consistency is that no ultimate 
and (aB) 2 0. 

= A.N— {AB}-N 


-B)}.N 
N-(A)—(B) + (AB) 


nd C, the conditions of consistency 


BC) = (AB) + (AC)-(A) 
BC) 2 (AC) + (BC) -(O) 
BC) < (AC) 


(C)+N 


tall ultimate frequencies are non- 


tv). 


THEORY AND ASSOCIATION OF ATTRIBUTES 97 


From (ii) and (vii) 
(AB) + (AC) -(A) $ (BC) 
ie, (AB)+(AC)-(BC) S$ (A). 
Similarly from (iii) and (vi), (iv) and (v) 
(AB) + (BC) - (AC) $ (B) 
and (AC) +(BC)-(AB) < (©). 
Ex. 3-14. /fa report gives the following frequencies as actually observed, show that 
there must be a misprint or mistake of some sort: 
N= 1000, (A) =510, (B) = 490, (C) = 427, (4B) = 189 (AC) = 140, (BC) = 85 
Sol. Now (AB) + (AC) + (BC) = 189 + 140 + 85 = 414 
and (A) + (B) + (C)— N= 510 + 490 + 427 — 1000 = 427 
(AB) + (AC) + (BC) <(A) + (B) +(C)-N 
.. Data is not consistent and hence there must be a misprint or mistake of some sort. 
Ex. 3-15. [fin an urban district 817 per thousand of the women between 20 and 25 
years of age were returned as “occupied” at a census and 263 per thousand as married or 
widowed, what is the lowest proportion per thousand of the married or widowed that must 
have been occupied ? 
Sol. Let A and B denote the attributes of being occupied and being married or widowed 
respectively. Then 
N= 1000, (A) = 817 and (B) = 263 
(AB) 2 (A) + (B) - N= 817 + 263 — 1000 = 80 
.. Lowest proportion per thousand of the married or widowed that must have been 
occupied 


ae x 1000 = 304 
263 , 


Ex. 3-16. A market investigator returns the following data. Of 1000 people consultea, 
811 liked chocolates, 752 liked toffee and 418 liked boiled sweets; 570 liked chocolates and 
toffee, 356 liked chocolates and boiled sweets and 348 liked toffee and boiled sweets; 297 
liked all three. Show that this information as it stands must be incorrect. 
Sol. Let A, B and C denote the attributes of having liking for chocolates, toffee and 
boiled sweets respectively. Then 
N= 1000, (A) =811, (B)=752, (C)=418, (AB) =570, (AC) =356, (BC) =348 
and (ABC) = 297. 
Now (aBy) 


N- (A) -(B)-(C) + (AB) + (AC) + (BC) - (ABC) 
1000 — 811 — 752 — 418 + 570 + 356 + 348 — 297 
= _-4<0 
Information is incorrect. 

Ex. 3-17. Jn a war between White and Red forces there are more Red soldiers than 
White; there are more armed Whites than unarmed Reds : there are fewer armed Reds with 
ammunition than unarmed Whites without ammunition. Show that there are more armed 
Reds without ammunition than unarmed Whites with ammunition. 

Sol. Let A, B and C denote the attributes of being. White, armed and possessed with 
ammunition. Then 


Hl 


(A) < (a), (AB) > (a8), (ABC) < (4By) 
and it is to be shown that 
(aBy) > (ABC) 


Now (aBy) = (aB)-(aBC) 
> (0B) ~ (ABy) = (a) - (0.8) - (ABY) 
> (A) - (AB) — (ABy) = (AB) - (4By) = (ABO) 
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Ex. 3-18. Jf in a series of houses actually invaded by smallpox 70% of the inhabitants 
are attacked and 85% have been vaccinated, what is the lowest percentage of the vaccinated 
that must have been attacked ? 

Sol. Let A and B denote the attributes of being attacked and vaccinated. Then 

N= 100, (A)=70 and (B)=85- 
Now (AB) > (A) + (B) —N=70 + 85 — 100 = 55 
, Lowest percentage of the vaccinated that have been attacked 
= = x 100 = 65% (approx.) 

Ex. 3-19. [fall A’ are BS and all Bs are C%, show that all A’ are Cs. 

Sol. It is given that (AB) = (A), (BC) = (B) 
and it is to be shown that 

(AC) = (A) 
From Ex. 3- 13, (AB) — (AC) + (BC) $ (B) 
(A) - (AC) +(B) s (B) 


ba (A) < (AC) 
But (AC) < (A) 
(AC) = (A). 


Ex. 3-20. [fall A’ are BS and no B's are C’s, show that no A’ are Cs. 
Sol. It is given that (AB) = (A) and (BC) = 0 
and it is to be proved that (AC) = 0 
- Now from Ex. 3-13, (4B) + (AC) - (BC) s (A) 
(A) + (AC) s (4) 
(AC) < 0 
(AC) = 0 (. (AC) 20) 


N a 
Ex. 3-21. Given that (A) = (B) =(QC)= a and 80% of the A’s are Bs, 75% of A's are 


C%, find the limits to the percentage of B's that are Cs. 
Sol. It is given that 
(AB) = 0-8 (4) =04N 
(AC) = 0-75 (A) = 0-375 N 
and the limits of (BC) are to be obtained. From Ex. 3-13, 
(AB) + (AC) - (BC) ¢ (A) 
(BC) 2 (AB) + (AC) —- (A) = 0-4 N+ 0-375 N-0-5N 
.. (BC) 20-275 N= 0-55 (B) 
and ~ (AB)-(AC)+(BO) <(B) 
-. (BC) S(B) - (AB) + (AC) = 05 N-0-4N +0375 N 
(BC) < 0-475 N=0-95(B) 
(BC) 
5 <-—-— < ()- 
0:55 SB) 0-95 
Reqd. limits are 55% and 95%. 

Ex. 3-22. Among the adult population of a certain town 50% of the population are 
male, 60% are wage-earners and 50% are 45 years of age or over, 10% of the males are not 
wage earners and 40% of the males are under 45. Can you infer anything about what 
percentage of the population of 45 or over are wage-earners ? 

Sol. Let A, B, C denote the attributes of being male, wage-earner and 45 years old or 
more. Then 
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N= 100, (A) = 50, (B) =6C 


The limits of (BC) are to be 
Now (AB) = 
and (AC) = 
From Ex. 3-13, (AB) + (AC 
(BC) 2 (AB) + (AC) - 
.. (BC) 225 
and (AB) —(AC) + (BC) Ss 
J (BC) S(B)+(AC)-¢ 
(BC) < 45 
25 < (BC) 45, 
Percentage of the pop 


between = x 100 = 50% and- 


Ex. 3-23. (a) The following 
defects amongst a number of sch 
A = development defects, B 
N= 10,000, (A) = 877, (B) 
(AB) = 338, (BC) = 455. 
Show that some dull boys a 


- least must be so. 


(b) The following are the c 
N= 10,000, (A) = 682, (B) 
(AB) = 248, (BC) = 363. 
Show that some defectively 
must be so. 
Sol. (a) It is required to fir 
Now (aC) = (C) - (AC) 
Now (AC) + (4B) - (BC) s 


and — (AB) + (AC) + (BC) 
.. (AC) S$ (A) + (BC) -( 
. (AC) $994 

and (AC) < (C) + (AB) - ( 
. (AC) $672 


(aC) 2 (C) - 672 =7. 
(b) Left as an exercise. 
Ex. 3-24. Given that 50% c 
60), 80% non-able-bodied, 35% 
bodied and aged, find the great 
men. 
Sol. Let A, B and C denote 
Then N= 100, (A) =50, ( 
(AB) = 35, (AC) = 4: 
and the limits of (ABC) are to b 
From Ex. 3-13, (ABC) 
(ABC) 
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‘smallpox 70% of the inhabitants 
west percentage of the vaccinated 


ced and vaccinated. Then 


been attacked 


rox.) 


that all AS are CS. 


that no A’ are Cs. 


f the A’ are B's, 75% of A’ are 


5N 


SN 


wn 50% of the population are 
‘over, 10% of the males are not 


‘ou infer anything about what 
s? 


age-earner and 45 years old or ~ 
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N= 100, (A) =50, (B)=60, (C)=50, (AB) = a x50=5— 


40 
and (Ay) = —~ x 50=20- 


100 
The limits of (BC) are to be obtained 
Now (AB) = (A)- (4B) =50-5 = 45 
and (AC) = (A)- (Ay) = 50-20 = 30 


From Ex. 3-13, (AB) + (AC) - (BC) § (A) 
.. (BC) 2 (AB) + (AC) - (A) = 45 + 30-50 
(BC) 225 
and (AB) — (AC) + (BC) S$ (B) 
-. (BC) $(B) + (AC) - (AB) = 60 + 30-45 
(BC) < 45 
25s (BC) <45, 
Percentage of the population of 45 years old or more who are wage-earners lies 


between ee x 100 = 50% and a x 100 = 90%. 
50 50 


Ex. 3-23. (a) The following are the proportions of boys observed for certain classes of 
defects amongst a number of school-children. 

A = development defects, B = nerve signs, C = mental dullness. 

N= 10,000, (A) = 877, (B) = 1,086, (C) = 789 

(AB) = 338, (BC) = 455. 

Show that some dull boys do not exhibit development defects and state how many at 


- least must be so. 


(b) The following are the corresponding figures for girls : 
N= 10,000, (A) = 682, (B) = 850, (C) = 689 
(AB) = 248, (BC) = 363. 
Show that some defectively developed girls are not dull and state how many at least 
must be so. 
Sol. (a) It is required to find the lower limit of (aC) 
Now (aC) =(C)- (AC) 
Now (AC) + (AB) — (BO) § (A) 
and - (AB) + (AC) + (BC) $ (C) 
.. (AC) $(A) + (BO) - (AB) = 877 + 455 — 338 
J. (AC) $994 
and (AC) $ (C) + (AB) — (BC) = 789 + 338 — 455 
: (AC) $ 672 
(aC) = (C) — 672 = 789 — 672 = 117 
(b) Left as an exercise. 
_ Ex. 3-24. Given that 50% of the inmates of an institution are men, 60% c are aged (over 
60), 80% non-able-bodied, 35% aged men, 45% non-able-bodied men and 42% non-able- 
bodied and aged, find the greatest and least possible proportions of non-able-bodied aged 
men. 
Sol. Let A, B and C denote the attributes of being man, aged and non-able-bodied. 
Then N= 100, (4) =50, (8) = 60, (C) = 80. 
(AB) = 35, (AC) =45 and (BC) =42 
and the limits of (ABC) are to be obtained. 
From Ex. 3-13, (ABC) 2 (AB) + (AC) -(A) = 30 
, (ABC) 2 (AB) +(BC)-(B)=17 
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and (ABC) 2 (AC) +(BOC)-(C)=7 

.. (ABC) 2 30 satisfies all the three inequalities. 

Also (ABC) < (AB) = 35, (ABC) $ (AC) = 45, (ABC) $ (BC) = 42 
_ and (ABC) $ (AB) + (AC) + (BC) - (A) - (B) - (CE) + N= 32 

(ABC) s 32 
30s (ABC) 232. 

Ex. 3-25. 50% of the imports of barley into a country come from the Dominions : 80% 
of the total imports go to brewing; 75% of the imports are grown in Northern Hemisphere, 
‘80% of Northern-grown barley goes to brewing; 100% of foreign Southern-grown barley 
goes to stock-feeding. Show that the foreign Northern grown barley which goes to brewing 
cannot be less than 30% nor more than 50% of the total imports. (It is assumed that brewing 
and stock-feeding are the only two uses to which imported barley is put). 


Sol. Let A, B and C denote the attributes of the barley coming from dominions, being - 


used in brewing and growing in Northern Hemisphere respectively. Then 
N = 100, (A) =50, (8) =80, (C) =75 


8 
(BC) = ~~ x 75 = 60 and (cy) = (aBy) 


and it is to be shown that 


30 < (aBC)<50 
Now (aBy) = (cy) = (aBy) + (aBy) 
os (aBy) = 0 
or (aB)- (BC) = 0 ie, (AB) = (ABC) 
From Ex. 3-12, (0B) = (a) + (B)- N= 30 
and (0B) <(a) = 50 "* (a) =N-(A)= 50 


ae 30 < (aBC)< S50. 

Ex. 3-26. A penny is tossed three times and the results, heads and tails, noted. The 
process is continued until there are 100 sets of threes. In 69 cases heads fell first, in 49 cases 
heads fell second and in 53 cases heads fell third. In 33 cases heads fell both first and 
second and in 21 cases heads fell both second and third. Show that there must have been at 
least 5 occasions on which heads fell three times and that there could not have been more 
than 15 occasions on which tail fell three times, though there need not have been any. 

Sol. Let 4, B and C denote the attributes of getting head in first, second and third trial 
respectively. Then 

N= 100, (A) = 69, (B) = 49, (C) = 53, (AB) = 33, (BC) =21 

From Ex. 3-13, (ABC) 2 (AB) + (BC) — (B) =5 

(ay) < (0B) = oB.N = {1 - A} - {1 - B}.N=N-(A)-(B) + (AB) = 15 
and (oBy) s (By) = N-(B)- (©) + (BC) = 19 


(ay) < 15. 
N 
Ex. 3-27. Given that (A) = (B) =(C)= >. and that 
(AB) _ (AQ) _ 
N Ne? 
(BC) 
find what must be the greatest and least values of p in order that we may infer that v 


exceeds any given value, say q. 
Sol. From Ex. 3-13, (4B) + (BC) + (AC) 2 (A) + (B) + (C)-N 


N 
2Np + (BC) 2 7 
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(BOC) 1 

N 2 

and (AB) + (AC) - (BO) ¥ (A) 
2pN-(BC) < > 


(20) 
e 2 2p 


A 


Since is to exceed q, fror 


1 
Soered and 2 


Le, ps (1-2) and p 
From Ex. 3-12, (AB) $ (A) 

ps B 

; 2 

Since (AB) ¢ 0, 


p must lie between 0 and : 


Ex. 3-28. Show that if oe =x 
A 
and 2) 2) = a 


the value of neither x nor y can excee 


Sol. From Ex. 3-12,(BC) 2 (B 
and (AB) $< (4 


IV 


x 5x 


yours 


Ex. 3-29. Show that for n attrib 


(ABC...... M)2 {(A) +(B)+ (QC) 
Sol. From Ex. 3-12,(AB) 2 (A 


Replacing B by BC 

(ABC) 2 (A 
Also (BC) 2 (B 
: (ABC) 2 (A 


In general for m attributes 4, B, | 
(ABC ..... P)2 (A) + (B) + | 
Replacing P by RS and using 
(RS) 2 (R 
for (m + 1) attributes 
(ABC ...... RS) 2 (A) + (B)- 
If the inequality holds for 
the inequality holds for three attribute 
of attributes. 
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(BC) = 42 
=32 


me from the Dominions : 80% 
»wn in Northern Hemisphere; 
reign Southern-grown barley 
barley which goes to brewing 
ts. (It is assumed that brewing 
irley is put). 


yming from dominions, being — 


tively. Then 


"" (a) =N-(A)=50 


, heads and tails, noted. The 
ves heads fell first, in 49 cases 
ses heads fell both first and 
’ that there must have been at 
‘re could not have been more 
need not have been any. 

in first, second and third trial 


=21 


‘A) —(B) + (AB) = 15 


(BO) 
N 


that we may infer that 


C)-N 
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(BC) , 1 
—— 2S 1 
a 5 oP (1) 
and (AB) + (AC) - (BC) s (A) 
N 
2pN-(BC) s > 
AO ona ..Q) 
N 2 
C). 
Since is to exceed q, from (1) and (2) 
uF —2p2q and 2 a! 2 
2 Pp=q P 2 =4q 
ie, ps : (1-29) and p> (2q +1) . (3) 
From Ex. 3-12, (AB) $ (A) 
Pas 
. B ~ 2 
Since (AB) 4 0, p20 


p must lie between 0 and ; (1 — 2q) or between ; (1 + 2q) and 3 


Ex. 3-28. Show thati¢ O = x= 2) -O 
and (AB) = (AQ) _ (BO _ 
N NN > 


; 1 
the value of neither x nor y can exceed z 


Sol. From Ex. 3-12,(BC) 2 (B)+(C)-N ie, y25x-1 
and (AB) S$ (A) ie, yx, 


1 
2 5sx-lie, x <— 
x x-lie, x 4 


ysxs 


ala 


Ex. 3-29. Show that for n attributes A, B, C, ...M 
(ABC... M) 2 {(A) + (B) + (C) + .... + (M)} -(n- 1)N where N is the total frequency. 
Sol. From Ex. 3-12,(AB) = (A) +(B)-—N 
Replacing B by BC 
(ABC) 2 (A)+(BC)-N 
Also (BC) = (B)+(C-N 
(ABC) 2 (A) +(B)+(C)-2N 
In general for m attributes A, B, C, .... P, let 
(ABC..... P) 2 (A) + (B) + (C. +...+(P)-(m-1)N 
Replacing P by RS and using 
(RS) 2 (R)+(S)-N, 
for (m + 1) attributes 
(ABC ...... RS) 2 (A) + (B) + wees. +(S)-mN 
If the inequality holds for m attributes it also holds for (m + 1) attributes. Since 
the inequality holds for three attributes, it also holds for four and hence five and any number 
of attributes. 
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Ex. 3-30. /n a very hotly fought battle 70% at least of the combatants lost an eye, 75% 
at least lost an ear, 80% at least lost an arm and 85% at least lost a leg. How many at least 
must have lost all four ? 

Sol. Let A, B, C, D denote losing an eye, an ear, an arm and a leg respectively. Then V 
= 100, (A) 2 70, (B) 2 75, (C) 2 80 and (D) > 85. Now from Ex. 3-29. 

(ABCD) > (A)+(B)+(C)+(D)-3N 
= 70+75 + 80 + 85 — 300 
= 10 
10% at least have lost all four. 
3.2. Association of Attributes 

Independence. /f there is no relationship of any kind between two attributes A and B, 
it is expected to have the same proportion of As among B's as among not B’ i.e., Bs. Two 
such attributes are termed as independent and the criterion of independence of two attributes 
A and B is 

CAB) SAD) 
(B) (B) 

Association. /f A and B are not independent, these are related in some way or other, 

however, complicated. These are said to be positively associated or simply associated if 


(AB) > (a negatively associated if (AB) < (A)(B) 


In statistics A and B are said to be associated only if these appear together in a greater 
number of cases than is to be expected if these are independent. 


Complete Association. Two attributes are said to be completely associated if one of | 


them cannot occur without the other though ihe other may occur without the one i.e., all A’s 
are Bs or all B’ are A’s according as whether A’s or Bs are in minority. 

Complete Disassociation. /t may be taken either as the case when no A’ are B’; or 
the case when no a5 are Bs. 


Co-efficient of Association. /t is given by 
_ (AB) (oB)-(AB)(@B) 
(AB) (ap) +(AB)(a.B) 
Co-efficient of Colligation. /t is defined by 
_ ¥{(AB) (aB)} - ¥{(AB)(@B)} 
~ ¥{(AB) (@B)} + V(AB)(B)} 


Symbols (AB) = a 
= (AB) - (AB). 
Ex. 3-31. [fA and B be two independent attributes, prove that 
¢) 22) (2B) — Gy (@B)_(AB) Gy (AB) _ (OB) 
- (2) 6B) (a)° (A) (4) (a) 


and (iv) (AB)= a. 


Sol. Since A and B are independent, 
(4B) _ (4B) 
“@® ~ ®) ee 
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- (AB) 
(i) From (1), (AB) = 


each ratio = 


(8) 

(8) 

i le 

(ii) From (1), (B) = 

(AB) 

(B) 
(AB) 

(B) - (4B) 

(4B) 

(A) 

(4B) _ 
(iii) From (2), (B) 
(AB) 

(B)-(AB) 

(4B) 

(A) 


(iv) From (2), (AB) 


Ex. 3-32. Show, whether A ar 
associated in each of the followin; 
(i) N= 100, (A) = 47, (B) 

(ii) (A) = 490, (AB) = 294, 
(iii) (AB) = 256, (&B) = 768 
(A)(B) _ (4762) 

So) N 100 

Since (AB) > (AB)po, attributes 

(ii) N=(a) + (A) = 1060, | 
(A)(B) _ (4% 
1 


(AB)o = 


(AB) < (AB)o 
.. Attributes are negativel 
(iii) (A) =(AB) + (AB) =25 
(B) = (AB) + (0B) = 25 
(x) = (aB) + (aB) = 76 
N=(A)+ (a) = 304+! 
B 304 
ine An ) _ Bo 
A and B are | nee 
Ex. 3-33. The male populati 
lakhs and the total number of ma. 
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(AB) _ (B)_ (AB)+(aB) 


(i) From (1), — x = 


(AB) = (B)_— (4B) + (a) 


. (0B) 

each ratio = (aB) 

() _ (8) 

(B) (a) 

. (AB) _ (AB) 
(ii) From (1), (B) = (8) = 

(4B) _ (4) 

(1) Nn 

(4B) _ _(A) 
(8) (48) N-(A~ 

(4B) _ (ab) 

(4) (a) 

(iii) From (2), > = i 
(AB) __—_—(A) 
(B)-(AB) —- N-(A) 

(AB) _ (a8) 

(A) (a) 


(iv) From (2), (AB) 


(AB) 


(A) 


N 


(B) (B) 
(AB)+(AB) _ 
(B)+(B) 
or AB) . 
(a8) ~ (a) 
(AB) _ (A) 
(aB) (a) 
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(2) 


Ex. 3-32. Show, whether A and B are independent, positively associated or negatively 


associated in each of the following cases : 


(i) N= 100, (4) =47, (B) = 62, (AB) = 32. 
(ii) (A) = 490, (AB) =294, (a) = 570, (0B) = 380. - 
(iii) (AB) = 256, (aB) = 768, (AB) = 48, (@B) = 144. 


(A)(B) _ (47)(62) 


eee 100 


=29-14= 


(AB)o 


Since (AB) > (AB)o, attributes are positively associated. 
(ii) N=(a) + (A) = 1060, (B) = (AB) + (0B) = 674. 


(A)(B) _ (490)(674) 
(AB) = N= 1060 


(AB) < (AB)o 


bes Attributes are negatively associated. 
(iii) (A) = (AB) + (AB) = 256 + 48 = 304 
(B) = (AB) + (OB) = 256 + 768 = 1024 


() = (aB) + (aB) = 768 + 144 = 


912 


N=(A) + (a) = 304 + 912 = 1216 


(A)(B) _ (304) (1024) 


“Bis N 1216 


A and B are independent. 


= 256 = (AB) 


= 311-6 (nearly) 


Ex. 3-33. The male population of U.P. is 250 lakhs. The number of literate males is 20 
lakhs and the total number of male criminals is 26 thousands. The number of literate male 
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eS ES eee eek 


criminals is two thousands. Do you find any association between literary and criminality ? 
Sol. Let 4 and B denote the attributes of being literate and criminal respectively. Then 
(A) = 20 lakhs, (B) = 26 thousands = 0-26 lakhs. 
(AB) = 2 thousands = 0-02 lakhs and N = 250 lakhs 
(A)(B) _ (20)(0-26) 


(4B)o = 250 


= 0-0208 > (AB) 
.. A and B are negatively associated. 
Ex. 3.34. Show that 
j = | Be d deduce that Q = 0 when A and B are independ 
() O= (AB) (aB)+(AB)(oB) and deduce that Q = 0 when A and B are independent. 
(ii) Q=1 for complete association. 
(iii) Q=-1 for complete disassociation. 


(iv) Q= 7: where R is the co-efficient of colligation. 
‘Sol. (i) 5 = (AB)-(AB)o= (AB) - oe) 
NB = (AB) {(A) + (@)} - (A) {(AB) + (0B)} 
= (AB) {(aB) + (a8) ~ {(AB) + (AB)} (0B) 
= (AB) (aB) - (AB) (a) 
i ae SO kf a 
© * (AaB) (af) +(AB)(aB) 
If A and B are independent, 5=0 
4 Q=0 


(ii) If there is complete association, all As are B's or all BS are As according as 4 or 
B¥ are in minority, 


Either, (48) =0or (aB)=0 

ed Q=1. 

(iii) If there is complete disassociation, either no A’ are B's or no @’s and B’s. 
Either (4B) = 0 or (a6)=0 

i O=-1. 

(iv) Bydef, R= V¥{(4B)(oB)} — J{(4B)(aB)} 
7 V{(AB)(oB)} + f{(48)(0B)} 


1+R _  y(AB)(aB) 


1-R —— (AB)(aB) 
(+R)? 

(1-R)? __2R 
(+k) |, 1+R? 
(1-R)? 


“~ Note. From (i) Q > 0 for positive association 


< 0 for negative association 
; = 0 for independence. 
Ex. 3-35. [fA and B are independent, find the (AB), (A B), (&B) and (af). 


A)(B 
Sol. Since A and B are independent, (AB) = {o) 
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(AB) 


(0B) = 
(0B) = 


(B 
Ex. 3-36. Show that § =-~ 


7] 


Sol. 6 = 


Interchanging A and B 


Ex. 3-37. Show that 
(ABY + (a8) - (aB)’ - 


Sol. R.H.S. = {(A)-(a)} 


= [{(AB) + 

— {(AB) + ( 
= {(AB) ~ (a 
= {(AB) ~ (a 
= {(AB)- (a 
+2 {(aB) - 
= (ABY + (ql 
Ex. 3-38. Show that if 


(A 
(A 


be two aggregates corresponding 


(AB), — (AB)2 = (0.B)2 — (0B) 
Sol. (A) = 
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tween literary and criminality ? 
and criminal respectively. Then 


chs 


) 
) when A and B are independent. 


fon. 

(8) 

N 

AB) + (aB)} 

3) + (AB)} (0B) 


Bs are A according as A’s or 


: BS or no @’s and B’s. 


|, (0B) and (of). 
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(4B) = (4)-(4B)=(A) {1 = | ~ 
_ (ACB) 
N 


| 2 ; 
(aB) = (B)-(AB)=(B) {1-@t- ont ) 


(a8) = N-(A)-(B) + (4B) 
Lae 


= N—-(A)—-(B) + —— 


= @- 2 wy 


_ (BC) (AB) _ (4B) 
Ex. 3-36. Show that 6 = N 2 [an (B) 6) 2) 
Sol. 5 = (AB)- Oe . ul yee) (B)} 
= x [(AB) {(B) + (B)} — {(AB) + (AB)} (B)] 
1 


py (LAB) (B)- (AB) (8) 


- 0) re P 28) 
N |(B) 6) 
Interchanging A and B 
os LO) i a al 
N [(A) (a) 
Ex. 3-37. Show that 
(AB) + (a8)? - («BY — (AB) = [(A) - ()] [(B) - (B)] + 2B. 
Sol. R.H.S. = {(A)-(o)} {(B)-(B)} + 2N {caB) - ue 
= [{(AB) + (AB)} — {((aB) + (aB)}] [{(AB) + (0.B)} 

— {(AB) + (aB)}] + 2N (AB) - 2(A) (B) 

{(AB) - (aB)}? — {(AB) — (@B)}? + 2{(A) + (@)} (AB) - 2(A) (B) 
= {(AB)—(a)}? — {(4B) — (aB)}? + 2(A) {(AB) - (B)} + 2(@) (AB) 
= {(AB)—(a)}? — {(AB) — (0B)}? - 2 {(AB) + (AB)} (0B) 

+2 {(aB) + (aB)} (AB) 

= (ABY + (aBy - (AB) - (0B). 
Ex. 3-38. Show that if 
(AB), (&B),, (AB):, (a8): 
(AB), (0B)2, (AB)2, (0B)2 


be two aggregates corresponding to the same values of (A), (B), (a) and (8) 


(AB), — (AB)2 = (0B)2 — (&B); = (AB)2 - (AB): = (8): — (2B)2- 
Sol. — (A) = (AB), + (AB): = (AB) + (AB)2 
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(AB), — (AB) 


= (AB) - (AB), 1) 
(B) = (AB); + (0B); = (AB) + (0B), 
(AB); — (AB). = (OB): - (0B), (2) 
(a) = (0B), + (OB), = (GB) + (0B). 
(0B); —(&B)2 = (oB)2— (a), (3) 


Front (1), (2) and (3) result follows. 


Ex. 3-39. Investigate the association between eye colour of husband and eye-colour of 
wife from the data given below : 


Husbands with light eyes and wives with light eyes = 309 
Husbands with light eyes and wives with not light eyes =214 
Husbands with not light eyes and wives with light eyes = 132 


Husbands with not light eyes and wives with not light eyes = 119 

Sol. Let A, B denote the attributes of husbands with light eyes and wives with light eyes 
repectively. 

Then (AB) = 309, (4B) =214, (a8) = 132 and (oB) = =119. 


oe (AB)(aB)— (AB)(aB) Le (309)(119) — (214)(132) 


(AB)(aB)+ (AB)(aB)  (309)(119) + (214)(132) 
36771-28248 
~ 36771428248 ~ 913 


’. There seems to be positive association of small degree. Working out percentage : 
Percentages of light-eyed amongst the wives of light-eyed husbands = 
309 


2144309 * 100 = 59% and percentage of light-eyed amongst the wives of not light-eyed 
husbands = 7324119 x 100 = 53%. Comparison brings out that the association is small, so 


small that no stress can be laid on it as indicating anything but a fluctuation of sampling. 


Ex. 3-40. Investigate the association between darkness of eye colour in father and son 
from the following data : 


Father with dark eyes and sons with dark eyes =50 
Father with dark eyes and sons without dark eyes =79 
Father without dark eyes and sons with dark eyes = 89 


Father without dark eyes and sons without dark eyes = 782 
What would have been the frequency of ‘fathers with dark eyes and sons with dark eyes’ for 
the same total number, had there been complete independence ? 


Sol. Let A and B be the attributes of father and son to be with dark eyes respectively. 
Then 


(AB) = 50, (AB) = 79, (B) = 89 and (oP) = 782. 
_ (50)(782) - (79)(89) _ 32069 _ 
~ (50)(782) + (79)(89) 46131 


. There is a positive association of high degree between the darkness of eye colours in 
father and son. 


Now (A) = (AB) + (AB) = 50 +79 = 129 


(B) = (AB) + (aB) =50+ 89 = 139 
N = 50+79 + 89 + 782 = 1000 
_ (A) _ 0129)0139) _ 
(AB)o = N 1000 18 (approx.) 


Had 4 and B been independent. 


= 
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(AB) = (Al 

Ex. 3-41. The following data rel 
persons. You are required to calcula. 
and unemployment and interpret it. 

Illiterate unemployed 

Literate employed 

Illiterate employed 

Sol. Let A and B denote the attr 
Then 


(AB) = 
(AB) = 20 
a 
Q G 
= 0 


Thus there is a positive associati 
Thus in general literate person in un 

Ex. 3-42. From the following, f 

Total population = 16,264,000, 
7,623, number of bald-headed blind 

Sol. Let A and B denote the at 
Then 


(AB) = 2: 
C 

(AB)o = 

(AB) > & 


‘. There is positive associatio 

Ex. 3-43. Do you find any assoc 
the following data : 

Good-natured brothers and go. 

Good-natured brothers and sul 

Sullen-natured brothers and gc 

Sullen-natured brothers and su 

Sol. Let A and B denote the : 
respectively. Then 

(AB) = 1230, (AB) = 850, (a 


Q=" 


= ( 
There is positive associa 
Ex. 3-44. Can vaccination b. 
the data given below ? 
‘Of 1482 persons in a locality 
‘Of 1482 persons, 343 had be 
Sol. Let A and B denote the < 


Then 
N= 1482, (AB) =35, ( 
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A) 
- (AB) 

2) 
- (a8), 

G3) 


ur of husband and eye-colour of 


= 309 
; =214 
; = 132 
eyes =119 
at eyes and wives with light eyes 


-119, 
_ (309)(119) ~ (214)(132) 
(309) (119) + (214)(132) 


ree. Working out percentage : 
of light-eyed husbands = 


agst the wives of not light-eyed 
‘that the association is small, so 


out a fluctuation of sampling. 


‘of eye colour in father and son 


= 782 

es and sons with dark eyes’ for 
ice ? 
be with dark eyes respectively. 


32069 
46131 9! 


n the darkness of eye-colour in 
129 

139 

0 


18 (approx.) 
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(AB) = (AB)o= 18. 
Ex. 3-41. The following data relates to literacy and unemployment in a group of 500 
persons. You are required to calculate Yule’ co-efficient of association between literacy 
and unemployment and interpret it. 


Illiterate unemployed 220 
Literate employed 20 
180 


Illiterate employed 
Sol. Let A and B denote the attributes of being literate and unemployed respectively. 


Then 
(AB) = 500 -(220 + 20 + 180) = 80 
(AB) = 20 (aB) = 220 and (af) = 180 
o = VED Os) (80)(180) — (20)(220) 
(AB)(aB) + (AB)(@B) — (80) (180) +(20)(220) 
= 05. 
Thus there is a positive association of high degree between literacy and unemployment. 
Thus in general literate person in unemployed. 
Ex. 3-42. From the following, find whether blindness and baldness are associated. 
Total population = 16,264,000, number of bald-headed = 24,441, number of blind = 
7,623, number of bald-headed blind = 221. 
Sol. Let A and B denote the attributes of being bald-headed and blind respectively. 


Then 


(AB) = 221, (4) = 24,441, (B) = 7,623 and N= 16,264,000 

(A)(B) _ (24,441)(7,623) _ 
(4B)o = “yy "16,264,000 ~ 

fs (AB) > (AB)o 

‘. There is positive association between baldness and blindness. 

Ex. 3-43. Do you find any association between the tempers of brothers and sisters from 


the following data : 
Good-natured brothers and good-natured sisters = 1230 
Good-natured brothers and sullen-natured sisters = 850 
Sullen-natured brothers and good-natured sisters = 530 
Sullen-natured brothers and sullen-natured sisters = 980 
Sol. Let A and B denote the attributes of being good-natured for brother and sister 


respectively. Then 
(AB) = 1230, (AB) = 850, (.B) = 530 and (aB) = 980 
_ (AB)@B) - (AB)(aB) _ (1230)(980) — (850)(530) 
~ (AB)(aB) + (AB)(@B) — (1230) (980) + (850)(530) 
= 0-46. 
There is positive association. 
Ex. 3-44. Can vaccination be regarded as a preventive measure for small-pox from 
the data given below ? 
‘Of 1482 persons in a locality exposed to small pox, 368 in all were attacked’. 
‘Of 1482 persons, 343 had been vaccinated and of these only 35 were attacked’. 
Sol. Let A and B denote the attributes of being vaccinated and attacked respectively. 


Then 
N= 1482, (AB) = 35, (A) = 343, (B) = 368. 
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_ (A)(B) _ (343)(368) _ 
CS ~ oy 1482 
(AB) < (AB)o 
.. There is negative association and hence vaccination can be re 
measure for small-pox. 


Ex. 3-45. In an antimalarial campaign in a certain area, quinine was administered to 


812 persons out of a total population of 3248. The number of fever cases is shown below: 
Treatment 


garded as a preventive 


Fever No Fever 
Quinine . 20 792 
No quinine 220 2,216 


Discuss the usefulness of quinine in checking malaria. 
Sol. Let 4 and B correspond to fever and quinine. Then 
(AB) = 20, (&B) = 792, (AB) = 220 and (a8) = 2,216 


_ (20)(2216) — (792)(220) _ 
Q = (20)(2216) + (792)(220) 
.. There is a negative association of hi 
preventive malaria. 


Ex. 3-46. A group of 1000 fathers was studied and it was found that 12:9% had dark 
eyes. Among them the ratio of those having sons with dark eyes to those having sons with 


not dark eyes was 1 : 1-58. The number of cases where fathers and sons both did 


not have 
dark eyes was 782. Calculate a co-efficient of association between darkness of eye-colour 
in father and son. Give the Jrequ 


encies that would have been observed had there been 
completely no heredity. 


Sol. Let A and B denote the attributes of having dark eyes for father and son res 
Then 


gh degree. Hence quinine may be taken as 


pectively. 


Co x 1000 = 129, v=1000, 42) - 1 d = 799 
100 ahaa > (AB) 1-58 and (aB) = 


Now (AB) _ (AB) _ (AB)+(4B) (A) _ 129 


1 1-58 2-58 2:58 2.58 -° 
(AB) = 50 and (48) =79 
(B) = (AB) + (aB) = 79 + 782 = 861 
(B) = N-(B)= 1000-861 = 139 
(AB) + (aB) = 139 
(aB) = 89 


Now see Ex. 3-40, 


EXERCISES 


1. Ifa collection contains N items, each of which is characterized by one or more of the 
attributes A, B, C and D, show that with the usual notation 
(1) (ABCD) 2 (A) + (B)+(C)+ (D)-3N 
and (ii) (ABCD) = (ABD) +(A CD) — (AD) + (ADBy) 
where B and Y represent the characteristics of the absence of B and C respectively. 
- Three aptitude tests 4, B, C were given to 200 apprentice trainees. From amongst 
them 80 passed test 4, 78 passed test B and 96 passed the third test. While 20 passed 
all three tests, 42 failed all the three, 18 passed_4 and B but failed C and 38 failed A 
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and B but passed the third. Determine (i) how many trainees passed at least two of the 
three tests and (ii) whether the performances in tests A and B are associated. 
[Ans. 76, 0 = 0:3] 


. Inacollege 50% of the students are boys, 60% of the students are above 18 years, and 


80% receive scholarships, 35% of the students are boys above 18 years of age, 45% 
are boys receiving scholarships and 42% are above 18 years and receive scholarships. 
Determine the limits to the proportion of boys above 18 years who are in receipt of 
scholarships. [Ans. Lies between 30 and 32] 


. A study was made about the studying habits of the students of a certain university and 


the following summary is given at one place in the report. 

‘Of the students surveyed 75% were from well-to-do families, 55% were boys and 
60% were irregular in their studies. Out of the irregular one 50% were boys and two- 
thirds were from well-to-do families. The percentage of irregular boys from well-to- 
do families were 8. Is there any inconsistency in the data ? [Ans. Yes] 


. The following data relate to flexibility and selling ability of 20 salesmen. Test whether 


the two attributes are independent. 


Flexibility 
Selling ability Good Poor 
Good 7 3 
Poor - 2 8 


[Ans. Q = 0:8] 


. Calculate the co-efficient of association between illiteracy and criminality from the 


data given below and interpret it. 
The total population of a city is 244,000 out of which 40,000 are literates. The number 
of criminals in the group of literates is 300 and in the group of illiterates 4,000. 
[Ans. 0-5] 
The following data were observed for hybrids of Datura. 
Flowers violet, fruits prickly (AB) = 47 
Flowers violet, fruits smooth (AB) = 12 
Flowers white, fruits prickly (0B) = 21 
Flowers white, fruits smooth (a8) = 3 
Investigate the association between colour of flower and character of fruit. 
[Ans. (— 0:28)] 


. From the data given below, compare the association between literacy and 


unemployment in the rural and urban areas. 


Rural Urban 
Total number of adult males 25 lakhs 200 lakhs 
Literal males 10 lakhs 40 lakhs 
Unemployed 5 lakhs 4 lakhs 
Literate and unemployed males 3 lakhs 4 lakhs 


Sc GR Oa ot OO EE ERI 
. In an assortative mating study to find whether tall husbands tend to marry tall wives 


the following information about the wives of 125 tall and 125 short-statured husbands 
was published. 


Tall husbands Short husbands 
(percent) (percent) 
Tall wives 56 13 


Short wives 11 48 
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Find the co-efficient of association between the stature of wives and husbands, 
ignoring medium-sized wives. [Ans. 0-9] 


10. From the figures in the following table compare the association between literacy and 


11. 


unemployment in rural and urban areas. 


Urban Rural 
Total adult males 25 lakhs 20 lakhs 
Literal males 10 lakhs 10 lakhs 
Unemployed males : 5 lakhs 12 lakhs 
Literate and unemployed males 4 lakhs 4 lakhs 


In a state with a total population of 70,000 adults, 34,000 are males and out of a total 
of 6,000 graduates 700 are females. Out of 1200 graduate employees of the state, 200 
are females. Is there any sex bias in education among the people ? The state holds that 
no distinction is made in appointments in respect of sex. How far is their claim 
substantiated by the data given above ? 


12. A census revealed the following figures of the blind and the insane in two age groups 


13. 


14. 


in a certain population. 
2 SSS 


Age-group Age-group 

(15-25 years) (over 25 years) 

Total population 270,000 160,200 
No. of blind 1,000 2,000 
No. of insane 6,000 1,000 
No. of insane among the blind 19 9 


(a) Obtain a measure of the association between blindness and insanity in each of the 
two age group, (b) Do you consider that blindness and insanity are associated or 
disassociated with each other in the two age groups or more in one age group than the 
other ? 

Obtain the co-efficient of association between unemployment and educational 


attainments from the following results of an urban survey. 
Employed Unemployed 


Illiterate or below matric $997 432 
Matric and above §72 96 


[Ans 0-4] 


. The following table gives the number of literates and criminals in three cities of U.P. 
Se ee ae eee eee CLE 


Kanpur Allahabad Agra 


Total number (in thousands) 244 184 230 
Literates (in thousands) 30 47 33 
Literate criminals (in thousands) 3 2 =. 2 


Illiterate criminals (in thousands) 40 20 24 
aS ee 


Compare the degree of association between criminality and illiteracy in each of the 
above three cities. 


00 


Difference Ope 


4.1. Divided Differences 
Let the values of f(x) for x = x 


fix, x) i 
Kx, X15 X2) 7 
Ko; X1, X2; x3) = 


and so on, are called divided diffe 
Ex. 4-1. Compute the divide 


x : 1 
fix): a 
ok ee ee 
x fx) 1st order 
an2 
1 2 Bec 
8-4 
—_—— =4 
2 4 3_2 
16-8 
—— = 8 
3 4-3 
128-16 7 
4 16 7-4 
7 | 128 
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Difference Operators and Interpolation 


4.1. Divided Differences 


Let the values of f(x) for x = Xo, X1, «+ X, be known. Then 
f(x)- fo) 
Ko, 41) = sip 


f(x1,%2)— f(%0.*1) 
x2 —X9 
F(1,%2%3)- f(%05%1>%2) 
= X3- Xo 
and so on, are called divided differences of first order, second order, third order, etc. 
Ex. 4-1. Compute the divided differences of f(x) from the es table : 


Axo, X1, x2) 


S05 X15 X25 X3) 


x : 1 2 3 
Ix) : 2 4 8 16 128 
Sol. : 
x Sx) 1st order 2nd order 3rd order 4th order 
43 4-4 
1 02 ay =? eee 
2-1 l ‘ea 
8-4 8-4 i ee 53 _U 
| 2 4 ==" re aa 4-1 3 7-1 90 
| 112 22 
—-8 —--2 
| 16-8 3 220 3 16 
| F oe Wea 7-3. 3 7-2 15 
128-16 _ 112 
4 16 =ay 
7 128 
Ex. 4-2. Obtain the divided differences of fix) = x 
Sol. Let Xo, X1...... x, be the value of x. Then 
2 2 
fox)-fo) 7-3 
_ f)-fo) _ _ : 
S{%o, *1) rE oe 


fH; 42) - f0.%1) 


x2 —Xo 


i 


I%o, X41, X2) 


111 


112 
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SF (Xo, 1, X2, 43) = 


Sol. Let xo, x1, 


Then 


St (xo, XI, X2) ai 


_ (2 +%1)-(01 + x9) s 


x1 —Xo 


x2 —Xo 


I (%1,%2,%3)— f(X9,%1,%2) = 


1 


0 


X3—%0 
Evidently all higher order divided differences will be zero. 
Ex. 4-3. Prove that the divided differences are symmetrical in their arguments. 


sats X, be the arguments. 


FS (Xo; X1) = LOy-fE0) _ fe) , S (x9) 


x) —Xo 


I (1, X2)- f(x9,%1) 


X72 -XQ 


f(x2) 


~ (x2 —49)(22 =m) 


Let in general 


SF (%0; X15 Xm) = 


(Xm —X9)(Xm -x)) 


f(%m) 


eee 


at 4 | 


1 S(2) , fA) |_| fey) , £0) 
X72 —-Xo Xx -X] Xx -X2 x1 -Xo XQ - 1 


f(a) f (xo) 


(x1 -—x9)(%1-x2) (xq —x1)(x9 - x2) 


(Xm —Xm—1) 


f(%m-1) 


SC Car ae a ee 


I(x) 


St 


(Xo — 1 )(x9 —X2)..... (X9 -Xm) 


Then 


SF (Xo, X15 6 Xm+1) = 


F(X 1522 ++ Xmas) —S(X05%1 «+X pn) 


Xm+1 — Xo 


fms) 


1 
Xm+17X0 le —X1)(Xma1 —X2)--(Xm41 —Xm) 


f(Xm) 


(Xm —X1)(Xm =X ).0-(Xpy —Xm-1)(%m -Xm+1) 


ff ovecei 


+ 


tH eee 


f(*1) 


(x] —x2)....(%] — X41) 


ae | 
Xm+1 — Xo 


(Xm-1 -X0) 


+ 


f(%m) 


(Xm —XQ)--+-(Xyy —Xm-1) 


I(Xm-1) 


eee 


ieee (Xn 


f (xo) 


| —Xm_2)(Xm-1 —Xm) 


(Xo —X1)....(x9 =| 
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By induction, 


So, XY veee Xn) = 


(Xp 


(x9 

Evidently f(xo, x1, .... %.) rema 
JiXo, X1, «. Xn) is symmetrical in its 
Ex. 4-4. Show that divided difj 

of the divided differences of two fui 
Sol. Let f(x) and g(x) be two / 

A(x) = fx) +a 

h(x, 


TI 


itn 


Now Axo; X1...%n) = 


a 1 
7 IT (X_- 


ten 


+ —— 

I 

itn 

= f (Xo, %1 


Ex. 4-5. Show that the dividec 
the divided differences of f (x). 
Sol. Let 6 (x) = cf (x) 


_ 4 
ume 


ign 


Then (Xo, X1, ---. Xn) = 


MATHEMATICAL STATISTICS 


“4 
1x 
2) og 


zero. 
2trical in their arguments. 


(xq) 
07%] 


X1 —Xog Xo —x 


= fa) | feo) 


x1) 5 I(x) 


Mx1-%2) (Xo —x1)(x9 - x2) 


‘m-1) 
i-1) 
(%m—1 —Xm—-2)(%m—1 —Xm) 
(X09 Xm) 
Xm+1 — Xo 
mst) 


XQ )e (Xm -Xm) 


eter ed 
m1) (Xm -Xm+1) 


~Xm-1) 


\(Xm~1 —Xm) 
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= f(Xmu) 
(Xmit — XQ). (mat —Xm) 
f(%m) 
(Xm —X0)--++(%m —%m—1)(%m — X41) 


F(X) 


(Xo —%1).---%9 —X mat) 


By induction, 


_ f(%n) 
a (Xp —X0)-+--(%n —Xn-1) 
F(%n-1) 
+ : A vas 
(Xp-1 XQ) er (Xpat ~Xy-2)(%q-1 -Xn) 
f (x9) 


(x9 —%)-+(%9 Xn)” 
Evidently (xo, x1, .... X,) remains unchanged on interchanging the arguments. Hence 
JS{Xo, X1, «- Xn) is symmetrical in its arguments. 
Ex. 4-4, Show that divided differences of the sum of two functions are equal to the sum 
of the divided differences of two functions. 
Sol. Let f(x) and g(x) be two fs and 
A(x) = fix) + g(x) 


h h(Xp- 
Now A(xg; X10. Xn) f—4 _AGn) + — An) + “ae + __ (xo) 


I (Xp -x;) I] (Xy-1-%;) Il (xo —%;) 
itn ian~] i#0 
1 1 
“Ti {f (Xn) + g(%n)}+ Tacs) (fn) + 2(%n-1)} 
fen alla itn-l ced 
eee TI (xo -x) {f (xo + g(%o)} 
i#0 
opty =, ft aa XG) 
[] @.-* . [] G-a-2 sae | (x9 - %;) 
itn izn-l iz0 


ain) 8m) XO) 
TT Gn-%i) TT Gn-1-%3) IT (xo -*;) 
i#n itn-1 i#0 
= f (Xo, X41, + Xn) + B(X0, X1 Xn). 
Ex. 4-5. Show that the divided differences of ‘cf(x)’, where ‘c’ is constant, are ‘c’ times 
the divided differences of f (x). 
Sol. Let $ (x) = of) 


Oe) rd G0) 


Then $(%0, 15 +. %a)= FF (4n-%;) TL (p-1-*;) STS: - 2) 
1 i#0 


izn izn—- 
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enna ee SESS. 


fn), fn), 5 FR) 
TH Ga-%) TL Gn-1-¥4) IT (x9 -x;) 
iF 


itn t#n-1 


= Cf Xo, x1 -..- Xn). 
Ex. 4-6. hoi that nth order divided differences of x" are constant. 
Sol. Let f(x) =x" 
non é 
Kx, %1) = LO)= So) py sal vo eres xo tet xt! 
x1 ~X0 x1 —X9 
f(%1,%2)- f(%0,%1) 


x72 —Xo 


Axo, *1, X2) 


x2 —-X9 
n-2 n-3 n-2 
= (x3 +X> Xot+ eeoee + Xo ) 
+x (x3 SF 0 hese +x57*) + ov +x""? 


Thus if, f(x) =x", f(xo, x1) is a homogeneous function of degree (n — 1) in xo, X15; {Xo X1; 
X2) is a homogeneous function of degree (n — 2) in xo, x1, x2, and so on. Thus the operation 
of taking the divided difference lowers the degree by unity. Hence, finally, f(xo, x1, ... Xn) 
will be a homogeneous function of degree n —n = 0 i.e., a constant. 

Ex. 4-7. Prove that the third order divided difference with arguments a, b, c, d of the 


1 
function —is equal to — 
x 


abcd 
Sol. 
x f[@= - Ist order 2nd order 1 order 
1 1 
: a ~ ab 
1 i a 
b ~ be abc ~ abed 
1 1 l 
: c ~ ed bed 
1 
: d 


4.2. Descending and Ascending Differences 
Descending Differences. The first descending difference of f(x) is defined by 
Af(x) = f@+h-f@) 
where h is the increment in x. The operator ‘A’ is called descending or forward difference 
operator. The second, third, etc., differences are defined by A{Af{x)}. AIA {Af (x)}] etc. 


DIFFERENCE OPERATORS AND IN 


Operator E. The extensio 


Ef(x) = £@ 
Ascending Difference. Tt 
V(X) = FO 


The operator ‘V’ is called 
third, etc., differences are defin 
Central Differences. The 


5f(x) = i 


The operator ‘8’ is called ce 
are defined by 5{5 f(x)}, 5[5 £2 
Central Mean Operator. 


ah 

2 

Relations between Opera 

ee Le ee 

COV SE Ee ay 
(iii) A= —— 


~-V 
(iv) Saf? EF =AE 
Relation between divided | 


FOX ee R= 
Factorial Notation 

xm) = xf, 

xm) = (x 


Ex. 4-8. Given Up =3, U; 
Sol. Difference table is 


x U(x) r 

0 3 
1 12 ; 
6' 

2 81 
11 
3 200 
~10 

4 100 
-9 

5 8 

* APUp = 755. 


Ex. 4-9. Show that E=1 
Sol. By def. Aff) = A 
= (1 

where 1f(x) =f) 
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i og EON = 
1a) TE @o-a) 


i#0 
are constant. 


1 1 


aug 4 xg+...+x9 


degree (n * 1) in X0; X13 Sx, XI, 
, and so on. Thus the operation 
. Hence, finally, f(xo, x1, ... Xp) 
onstant. 

vith arguments a, b, c, d of the 


‘e of f (x) is defined by 


‘ending or forward difference 
{Afix)}. ATA {Af (x)}] ete. 
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Operator E. The extension or shift operator ‘E’ is defined by 


Ef (x) = f(x th) 
Ascending Difference. The first ascending difference of f(x) is defined by 


VF) = £@)-F@-A) 
The operator ‘V’ is called ascending or backward difference operator. The second, 
third, etc., differences are defined by V{Vf(x)}, V[V{Vf(x)}] etc. 
Central Differences. The first central difference of f(x) is defined by 


h h 
= fixt—|-flx-—- 
sro = s(2+4)-s(x-4) 
The operator ‘8’ is called central difference operator. The second, third, etc., differences 


are defined by 5{6 f(x)}, 5[5 {8 f(x)}] etc. 
Central Mean Operator. It is defined by 


m = 5 (EV? + By 


Relations between Operators. (i) A= E-1 


E A A 
(i) Vana t eet 

E E 1+A 
(i) Az 
Ul irene! 

-V 


(iv) Pe) = E'? ee FE? = AE? = VE}? 
Relation between divided differences and ordinary differences is : 
A" f(xo) 
I (Xo, xy soot Ba) = nth” 
Factorial Notation 
x0") = x(x —h)(x-2h).... (x-m-I1h) 
xO) = (x thy! (xt 2h)! .... (et mhy! 
Ex. 4-8. Given Uy =3, U; = 12, U, = 81, U3 = 200, Us = 100 and Us = 8. Find A°Up. 
Sol. Difference table is ; 


x U(x) A A? A? A‘ A’ 
0 3 
9 
1 12 60 
69 -10 
2 81 50 -259 
- 119 ~269 755 
3 200 ; -219 496 
-100 227 
4 100 ‘ 8 
-92 
5 8 
“, Up = 755. 


Ex. 4-9. Show thatE=1+ A. 
Sol. By def. Affx) = fixth)—-f&) = Ef) -SO) 
= (E-1)ftx) 
where 1 f(x) =f(x) 
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ee 
, A = E-1 or E=1+A 

Ex. 4-10. Show that E =e" where D denotes the derivative operator and deduce that 


» Ase _ 1, 


2 
Sol. By def, Ef{x) = fix +h) =flx) + hf’ (x) + + ff @+ ic 


1+Aze” or 


| ee eee } 
= —| Ay-—A*y+—A°y-—A’ y+..... 
(9 Dy = 5 [ay pate 5 yale 


Sol. From Ex. 4-10, “?=1+A 


() D = tog (1 +A) 


(i) eM = 1+ 


1 


1 1 
hD = ~ log (1-VaV+5V" +2 


ve ve 


Wi 
4 
N 
+ 
4 
+ 
4 
+ 
+ 


Ex. 4-12. Show that 
() At fx) + g(x)} = Aflx) + Ag(x) 
(i) E(x) + g(x)} = Eflx) + Eg(x) 
(iii) A{cf{x)} = cAf(x) 
(iv) E{of{x)} =cEf{x) 
(v) AEZEA 
(vi) A™ A"=A"A"™= A™*” 
(vii) BPE" SE'E"=E™" 
(wilt) Af fg} = flx + h) Ag(x) + g(x) Ax) 


DIFFERENCE OPERATORS AND INTER 


f(x)| _ gx) AF(x)- 
oe (Ant 8 a8 


Sol. (i) Af{fx) + 2(x)} 


(ii) E{ftx) = g(x} 
(iii) Aichlx)} 


(iv) Ete f(x)} 


(v) AEf\x) 

i AE 

(i) AMAT F(x) 

, Ama" 

Similarly A"A” 


(vii) E"E" f (x) 


(viii) A{f. g} 


ium it t mW &od moe oh hob done nod | 


(ix) A {42} = 


g(x) 


a a ae a a 


Ex. 4-13. [f (i) f(E) is a poly 


f(E)a* 


(ii) f(A) isa poiynomia 


Sol. (i) Let - f(é) 


(ii) Let f(A) 
, f(A) a 


Ex. 4-14. Show that 


—X 
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tive operator and deduce that | (ix) A {22 - 8) AM) FO) AR) 


g(x) g(x)g(x+h) 


Sol. (i) A{fx)+g@)} = {fx+h)+e(x+h)}- F(x) +2@)} 


" (x) + = {fix + h)-flx)} + {ee +h) - g@)} 
= Af(x) + Ag(x) 
Gi) Ex)t+e@)} = Axthstzgath= - Ef(x) + Eg(x) 
(iii) A{cf{x)} = f(x +h)- fx) 
= e{ f(x +h)—fx)} = cAfx) 
(iv) Exe f(x)} = of (x +h) = cEf{x) 
(v) AEfix) = Afftxt+ h)=fxt+ 2h) -flxth) 
= E{f(x+h)—f(x)} = EARx) 
ne AE = EA 
(vi) AA" f(x) = (AA... m times) (AA... n times) f(x) 
= (AA... (m+n) times) fx) = A"*" f(x) 
ber A™A" = A” +n 
Similarly A"A” = A™*" 
(vii) E"E" f(x) = (EE...m times) (EE...n times) 
= {EE...(m +n) times} f(x) = E" *" f(x) 
OE a Ee , 
(viii) A{f.g} = fx th). g(x +h) - fd). g(x) 
= fix + h) {g(x + h) - g(x)} + g(x) {Ax + h)- fo} 
= flx +h) Ag(x) + g(x) Ax) 
ro {A} _ L+H _ fR) _ x) fe+H-f@)g+h 
g(x) g(x+h) g(x) g(x) a(x +h) 
_ SOLC+)-fO)}-fO@)BE+h)-8)} 
Ax)gx+h) 
_ Sx) AS(x)- f(x) Agta) 
eer ee gixe(xth) | 
4 “= Ex. 4-13. [f(i) f(E) is a polynomials in E, show that 
f(a" = a’ f(a) 


(ii) f(A) is a polynomial in A, show that 
fAa = a f(a'- 1). 

Sol. (i) Let - f(E) = pot pi E+p Et... prE 

ss SEV = (pot pr Et py E+. + pn E") a 
pod + pia **+ pra? re Pe ali 
& {pot pia" + pra" + .... Py a} 
a f(a’) 
Pot py A? +... +p, A" 
(Pot pr At... + pad") a 
tbo a" + py (E= 1) a+ pr (E- 1)’ a +....+ pa (E~ 1)" a} 
a {pot pi (a"~ 1) + pz (a"- 1) +..... + Py ("= 1)"7] 
a f(a" —1). 


(ii) Let f(A) 
SAa& 


Ex. 4-14. Show that 
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Yo Ee * 
Sol. R.H.S. = E Mak 
én Abe) 59 thy SO 
2 MY (E! ) 7a A ( X+. ) — 
€ 
e* ce 
=e" e*)s Wee =e" =LHS. 


e = h 
Ex. 4-15. Show that A (tan™ x)= tan”! (Ao. where h is the interval of differencing. 
xh+x 


‘Sol. LHS. = tan! (x+ h) ~ tan! x 
. 1+x(x+h) 14+xh+x 


AZ 2 
Ex. 4-16. Explain the ailcenee between oven Ju and ae 
these functions when U, = x’. é EU, 


and find the values of 


ne 
Sol. (= E U, = Result of operating A? E™ on U, 


2 
and ( = = Ratio of the results of operating A? and E on U,. 


2 
E 
(x + hy’ - 2x3 + (x— A) = 6xh? 


Ax3 — (x+2h)>-2(x+hy+(x3) xh? 46h 6h? 


Ex (x+h)? ~ (xthP (x thy? 
Ex. 4-17. If f(x) = e™, show that f(x) and its leading eifferences are in G.P. 


il 


WE" (3°) = A? (x-h) = Af? - (x- A} 


Sol. f@ = & MG) =e) ~ = fe" 1} 
A? f(x) = Afe™ (e*~ 1)} =(e" — 1) Ae™ 
= (e*-17P &®, 
Similarly, AP f(x) = (e— 1)? & and so on. 


Evidently f(x), Af{x), A? f(x) .... are in G.P. 

Ex. 4-18. Show that (i) A"=E" —"c, E™!+"cy E”? |... + (-1)" 
(i) BE" SA" + "cy A"! + "cy ANP + FL. 

Sol. (i) By def., Af(x) = fe+h)— = fin= (E-1) Ax) 


A = E-1 

A? f(x) = fet 2h)-2f (x+ h) + fx) = (E -2E + 1) f(x) 
* A? = E?-2E+1 
Let, in general, N= EB -'c, BO'4+...4+(- 1G B+. + (AY 


Then A’*! = AA’=(E-1)(E'-'c, BO ' 4+... +I QE 
E'S ("ey +1) E+ ('e +'ey) B+ 


(-1)* Cog + "cy_1) Bt dy 


Ft + (-1)) 
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= ph le E 
wt ely 
", If the result hoids for n =r, itz 
hold for 1 = 2, hence it holds for n = 
(ii) left as an exercise. 
Ex. 4-19. /fy, = sin x, show thai 
Sol. A’ y, = (E 
= sin 
= 2s 
= kE 
where k=2(cosh—1). 
Ex. 4-20. Show that A" x" =7 
polynomial of degree ‘n’ is constant. 


Sol. Any" = A" 
= Al 
= A" 
+, 
= A" 
where a3, a4 etc., depend upon n and 
A" x ao A{ 
where bo is also constant depending 1 
A" x" = nf{ 

Now consider a polynomial 
f(x) = Ao 
Then A" f(x) = Ao 
= Ao 

Ex. 4-21. Show that A’ (1 - ax 
Sol. Let ox) = 
= ab 
A'° 6(x) = 10 
Ex, 4-22. If xo, 1, .... Xn be eque 
Al 
S{xo, x1, fet Hy) os 
Sol. Let Xie1- Xj; = A 
fi 
By def., fo, x1) = = 
fi 
Kxo, X15 2) a 
~ 9) 
A 


Let f (xo, Ryjeseree 


= 
a 
i 


Then (Xo, x1, cna Rip 1) = 
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a ee ae Sees ee 


AZ (on) eis 
“A? em 
e* =LHS. 


2re h is the interval of differencing. 


-1 h 
n ee 
1l+xh+x 


Ae 
aT and find the values of 


id Eon Uy. 


A{x-(x~A)?} 


6xh? 

*)  6xh? 46h? 6H? 
(x+h)> (x+hy* 

iferences are in G.P. 

=e™ {eo _ - 1} 

) Ae™ 


.+(-1)" 

x) 

t) =(E°- 2E + 1) f(x) 
"OnE 8+ 5+ (Ty 


cy E’~* +..4 (-1)’) 
+ 


DIFFERENCE OPERATORS AND INTERPOLATION 119 
CO es 
= 1 +4 r+i- arn ee eran te 
= E™ _r ro + Et! Lpitles gE’! ne es) ear Et ky 
ately 


’. Ifthe result hoids for n =r, it also holds for n =r + 1. But the result has been seen to 
hold for n = 2, hence it holds for n = 2 + 1 =3 and hence forn=3+1=4 and so on. 
(ii) left as an exercise. 
Ex. 4-19. If y, =sin x, show that A? x = kEy,, where k is some constant. 
Sol. A’ y, = (E-1) sinx =(E? - 2E + 1) sinx 
= sin (x + 2A) - 2sin (x + h) + sinx 
= 2sin(x +h) {cosh- 1} =ksin(x +h) 
= kE (sin x) 
where k=2(cosh-—1). 
Ex. 4-20. Show that A” x" = nth" and deduce that nth descending difference of a 
polynomial of degree ‘n’ is constant. 
Sol. AM x" = AN! AX =A"! (e+ A) -x"} 
AY Enh) + "cy WP xt + th 
A"? {nhdx"! + "cy WP Ax? +... + nh"! Ax} 
At? [nh{(x + eel =" +"c> h {(xt+ hy"? oe 
+..4¢0h" (x +h-x)] 
= A"? In(n- 1) x"7? + at hag et an] 
where a3, a4 etc., depend upon 7 and h. Proceeding likewise finally, 
A" x" = A{n(n-1)... 2h"! x + bo} 
where bo is also constant depending upon n and h. 
Ax" = n{n=1)..2K""' (x+h—x) = nln" 
Now consider a polynomial 
SQ) = Ax" + Axl +t Ay 
Then A" f(x) = Ay A" x"+ Ay Ane he An An 
= Aj A" x" = Ao nth" = constant. 
Ex. 4-21. Show that A'° (1 — ax) (1 — bx’) (1 - ex) (1 - dx) = abed 10! 
Sol. Let o(x) = (1 — ax) (1 — bx’) (1 - ex’) (1 - dr’) 
= abcd x'° + terms containing lower powers of x 
A’ $(x) = 10! abed (taking h = 1). 
Ex. 4-22. [f x0, X1, .... X, be equally spaced, show that 


S{Xoy X15 00 Xn) = 2 fio) 
nth 
Sol. Let Xjie1-X; = A(G=0, 1,..n-1) 
te h)- A 
os Hicedns La fe) . fea =flsa) , vir0) 
_ £%1.%2)- (osx) 1 a eee 
S{%o, ¥1, 2) = X_- Xx Oh é h 
l A* f(x) 
= pF BUM +H) S00} = 
Let FS (Xo, X41, «0 Xm) = Bio) 
mih 


Lise Rp) =F BGs 5X) 


Xm+1 ~%0 


Then f(x, *1, «...Xm4+1) = 
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ee A™ f(x) _ A” f(xo) 
(m+])h | mth™ mih™ 
1 
= Grape 4” (fro +H) - Fa) 
_ Am f(x0) 
(m+1)thntl 
", By induction, 
A" 
SlX0, X1 cee Xn) = A fro) for alln 2 1. 
nth 
; E-1 iY 
Ex. 4-23. Show that (i) Ve = (ii) VEZ A=EV. 


Sol. (i) By def, 


(it) 


Fy 
Ex. 4-24. Show that (i) (1+A)(1- ee =1 


sé 


Vflx) = f(x) ~f(x--) 
= fx)-E'! flx)=(1-E") fx) 


V 


VE 
EV 


nw 
= 
| 


ae 


(ii) AV= A-V 
() (+A) (1-V) = E(1-V) #£-EV=E-A=1 
(ii) AV=(E-1) VS EV-V2A- V. 


Ex, 4-28. Show that (i) A"x (oh) = (gmp ym®) 4-(m—nh) 


Sol. 


(ii) AGO = (Aye om + 

Ax) = (xt+ nyo) — ih) 
= th) (x) A)... 
x(x-A).... 
(mh) x™—"*) 


A? x!) = (mh) (mH) x2 = (my x 


Proceeding likewise finally 


(i) 


Ex. 4-26. Show that (i) Aab™ =(b* - 


Sol. 


A x") = [mal x m—nh) 
Left as an exercise. 
1) ab™ 
(ii) Ax = rx) 
(i) Aab™ = ab *) — ab™ = ab™ (b° - 1) 
(ii) See Ex. 4-25. 


Ex. 4-27. Show that 


(i) n(n 1) + (n= 1) (n= 2) +a #25 (nF (1) 


(ii) n(n —1) (n-2) + (n— 1) (n-2) (n-3) +... 43.2.1. =. (n +1) n(n-1) (n—-2) 
Sol. (i) Let = Ss 


n(n —1)+(n-1)(n- 2)+. snes +2.1 
n + (n- 1)? +420 


(x +h—- m-th)-x(x-A)... 
(x—_m=—2h) {(x+h)-(x-m-Ih)} 


(x—m-—Ih) 
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Now An® = 3n? 
1 
n2) = ra 
Changing n ton—1,n-2,.....3 
1 
Gals 
1 
Gata s 4 
1 
32 = = 
3 { 
1 
Adding S = rie 
ols 
3 
i, 
3 


(ii) Left as an exercise. 
Ex. 4-28. Express x—3x+1lin 
difference. 


Sol. Let o(x) = ae 
Then Ad(x) = a 
A°o(x) = 2" 
= 20 
and AP o(x) = 3° 
age 
a3 6 
ae 
a= 2 
a, = Ai 
a = O( 
(x) = ¢ 
Difference table of (x) is given 
x (x) 
0 1 
1 ~1 
2 3 
we 19 
: p(x) = 1 
A? (x) = 6 


Ex. 4.29. A third degree polyn 
(2, 1), and (3, — 2). Find the value a 
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a" fo 


mih™ 


xo + h) — f (%o)} 


-Az=1 


*) 


n+ nh) 


-1h)—x(x-h)... (x—m—Ih) 
—(x- m-1h)} 


+ 1)n(n-1) 


3.2.1. 7 (n +1) n(n-1) (n—2) 


Dee +21 
72) 
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ae als ee 
Now An® = 3n® 
1 
n® = 2 {n+ I-19} 
Changing n ton—1,n—2,.....3 


(1-1)? = ; {n — (n- 1%} 


(n= 2) = > (n(n 2) 


1 
32) = - (4) 3} 
1 
Adding S= a {(n+ 1) — 3} + 22) 
1 
= 3 {(n+1)n(n-1)-3.2.1}+2.1 


= 5 (ntl) m(n=1) 


(ii) Left as an exercise. 

Ex. 4-28. Express x° — 3x + 1 in the factorial notation and use it to obtain its second 
difference. 

Sol. Let O(x) = x -3xt+1leaytax+ ax + ayx®) 

Then Ad(x) = a, + 2) a x 4.3 g a3x' 2) 

A’o(x) = 2 ay +30) 9D a3 x) 
= 22) apt 32 a,x) 

and Ae o(x) = 3@ a3= 3°) a3 = 6a, 


1 
ag A’ $(0) 
1 2 
m2" 5 A’ o(0) 


a, = A¢(0) 
a = (0) 


1 1 
9) = (0) +.A6 (x7 + > A? G0) x + = A? GO 
Difference table of (x) is given below : 


— 


x (x) A . ee A’ 
: 

0 1 5 

1 ni 4 6 6 

2 3 16 12 

3 19 


(2) = 1= 2x 43,9 4, 
A? O(x) = 6+ 6x") = 6 + 6x. 
Ex. 4.29. A third degree polynomial f(x) is passed through the points (0, -1), (1, 1), 
(2, 1), and (3, — 2). Find the value at x = 1-2. 


122 MATHEMATICAL STATISTICS DIFFERENCE OPERATORS AND INTER 


Sol. Difference table of f(x) is Ex. 4-30. Show that 


x Ax) A x wk ¥ 52 
Fs Hf +y 
0 =] Yory = ii VO rT 3 
2 
1 1 6 23 “4 
2 | “3 “3 Sol. LHS. = 
3 2 : 


1 


Ax) = fO) + AKO) x + — eo) x4 2 A’ f(0) x - 


1 = 
mea ee 2x) — x2 = . x) 


= -1+2x-x(x- 1) = = xGe~ 1) (2) 


1 
je {-go-2)-1}+2]-1 Ex. 4-31. Show that 
Let B0(x) = — = ~ g(x) = go (x) (x—2)-1, xy ty tryst = 
82(x) = (= 1)gi () +2, 83 (x) = xgo(x) — 1 = fx). or poate oa 


ead ms _ 72:6 
go(1-2) = ~ =, 81 (12)= ( 5} 08) at 


5-48 


g2(12) = (02) (-28)42= 7 


5-48 
g3(1-2) = (1-2) (=) —~1=1-192 = f(x) 


The calculations are best carried out using the following computational scheme which 
is clearly related to synthetic division. [ 


= 89> (*-2) > (x-2) go = 


a) 
a-1g -—@-Ne 81 Ex. 4-32. Show that _ ; 
2 Vx = Vy-1 + Aye + A: 
82 xX > Xg2 
2035 Sol. R.H.S. = 
83 = fix) 


For the above question. 


Lia >) 0-4 
-- OS . ax rd pms 
a 3 


-1 2 
i MO a iene © 
3 3 = 
2) 
5-48 i 
> (1-2) > 2-192 
3 o Ex. 4-33. Show that 
eee? ab gtshs 
1-192= f(x) 2 7 


MATHEMATICAL STATISTICS 


A2 Ae 


—2 
=3 


? (0) x + : A? (0) x 


) 
Laie 2 
= xGx— 1) (x2) 


-t}2 | 


)(x-2)-1, 
83 (X) = x82(x) — 1 = f(x). 


92 = fix) 


ing computational scheme which 
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Ex. 4-30. Show that 


2 3 2 
Figen easy ees ne x. 2 
yor ys 7 + y2 Ti + ¥3 31 Fosse e fronayoo% A Pop 
x x? 2 
Sol. L.H.S. = eg eT he bias Yo 


= cjg et 


yay 
= etna {rennet + oho 


2 ) 
x 
= & {ve +xAyot oi A? yot -| =RHS. 


Ex. 4-31. Show that 


3 3 x x x 2 
xy; +x tx°y3t 005 cyto Ay te Wy te 
yy y2 ¥3 l-x?! (lx)? YI (ix) 1 
Sol. LHS. = (xE+ PE? +x°E* +...) yo 
gy ee ey 
1-xE°° 1-x-xa°? 
xE xA fe 
-Xx l-x 
2 
xE 
= ——i1+——A+t Ae + 
z 1l-x (ix)? | 
Sey ; Yi tz A? y, + 
ee 1 al y 1 (x) SI eveeee 
Ex. 4-32. Show that 
Vx = Yes t+ Ayy-2 + Ay, 3+..+ 4" nat De: 
7 t A A2 n-1 ‘ 
Sol. R.H.S. = E- a er Vy + N’ Yx-n 
= E'A4VtV tit Vy tA Ven 
1-Vv" 
= E" — Ly tA" yen 
1-v" 1-V"” A" 
( dy ta" x-n = Vx Vx 


= hl V"\yx + V" Ye = ye. 
Ex. 4-33. Show that 


13 : 
Av" — 2 APA + OA A. +m terms 


2.4 2.4.6 


123 
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where h = 1 
Sol. Since A’ x"=0 forr>m, 
13 3. 
LHS. = Av"— 1 a+ ait BS pte up to co 
2 2.4 2.4.6 
2 (s-jaraisa? aoe es 
A(L+ Ay)? x" = AE? x 
1\" 1\” 1\” 
= x--— = +— —_ —~ 
| rela laa ag Ca 
Ex. 4-34. Show that 
Vern = Yn te, Ayn +77 !cy A? yy_2 +7 * 73 AD yg tose 
: A 1 a 2 & 
Sol. R.H.S. = [1% a C2 ae C3 Bp che de Yn 
2 3 
= 1+x A G+) ie = sa Vn 
E 2! E 3! E 


A\-* 
—- (.- 2) Yn = E* Yn = Vote 


Ex. 4-35. Show that yy —Yyeii + Yr 2— Yass + 


eeees 


Le, tae, (LY A 1S A, 
Sol. RHS. = 5 SE‘ 2s) @ ale) Boys 


4 
1 bs -1/2 1 1 5 -1/2 
= 5(E+44] yet Ey fey) Yx 
1 J(£+1) Si 
= 7) 4 [ Et ve 


= (1-E+ EB -E+....yy 
= Ve — Vet 1 tyVxt2—Vetg t =L.H.S. 


i°3 13 13.5 
Ex. 4-36. Show that- - ~My) +— At yy 2- ———_ A® yy 3+ 
ow tnat” yx 8 Yx-1 8.16 yx-2 816.24 Yx-3 
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Sol. LHS. = 


Ex. 4-37. Show that yo +*c1 


Sol. R.HLS. 


Ex. 4-38. Show that 
A" Yx-n = Vx — "C1 Yx-1 
Sol. R.H.S. = 


Ex. 4-39. Show that 
Up + Uy +. F U, = 
Sol. L.H.S. : 
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3! 


+m 
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Sol. LHS. 


2 -1/2 2 -1 
1A 4E+(E-1) 
= /l+-—-— Vy =\ 
4E 4E 


= 2E" (1+ EY Ys 
= 2E'? {2+ A}' yy 


-1 
A 
= FE? (1 + >} Vx 


A 1 1 
= p2 144i g-is..| 
- { 2°4° 8 Ys 


Lf 
Avy 4 sonee 


: Ay aer 
xt> *t5 


=t05 
2 
= RHS. 


Ex. 4-37. Show that yo + *c, Ay; + cz A? yy +73 A’y3 + 


Sol. R.HLS 


Ex. 4-38. Show that 


= 2 4 
= yy t*ey A Yur t 72 A” Yea t oo 


2 4 
A 
= [iste Stephon 


= E* {E+A(E-1)}")x 
= E*{E-A+AE}* ys 
= E* {1+ AE} ys 


= E* (14%, AE +%q, WE’ +703 NE +... 


= yo t*e; Ay, + cz A’ y2 +c A’ y3t.... 
= LHS. 


A" Yen =Ju— "C1 Yaa + "C2 Yx—2F see tL)" Ven 
Sol. RHS. = (1—"c) E' +") E® ....) yx 


Ex. 4-39. Show that 


= (1-E')"y,=E" (E-1) "0x 
= BE" A"y, =A"Ven 


Up t U, + ue + Ug ="* bey Up t"* eg A Up +... A" Uo. 


Sol. LHS. = (1+ E+E’ +....+ E") Uo 
Et 1 
= “Ely Ue 


{(1+ A)"*!-1} Uo 


alas At+"* lo A? ee Amhy Uo 


/2 
Vx 


} yx 
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Ex. 4-40. Prove that 


Sol. 


n-] 
» U, x" 
0 


L.HLS. 


Ex. 4-41. Show that 


Sol. 


5{ Axa} 


R.HLS. 
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= +1 
= "*"e Up + "ey A Up + ms A" Up. 


= Mone Ui =x"Un Gee n 
1-x (1-x)? (AUp — x" AU;) 


x? 


+ 
(i-x)? 


(A? Up — x" A? UD) Fox 


= Ug txU,+xU2 +... +2! Uy 
= {14+ xE+P E+. x BU 
_ 1-x"E" 
1—xE Uo 
= (se Pp Uo 
1-x(1+A) 
- dex"e" [xa |" 
l-x 1-x Uo 
1-x"E" ik xA x2 a2 
— —— + 
1—x ea Gene 


1-x"E" 2 
= {u ee AUp + Z 


A? Ug +... 
l-x (po. 


: x 
- 1-x : (i-x)? {AUp — x” AU} 
2 


+ 


(1-x) {A* Up—x" A? Uj} +... 
= RHS. 
= pfx) dg(x) + Hex) f(x). 
th. 
E2+E 2 = mi 
= —— fee) [le -E ‘Jae 


+ es Ee g(x) [le - e] f(x) 
«Hole AeBHoleo$) ale 


+3 feed) eS) (e844) 
= eeS)(eo§) fF) e(-8 


DIFFERENCE OPERATORS AND IN1 


Ex. 4-42. Sum the series | 
Sol. U,, = nth term = (Qn - 
Difference table is 


x l 
1 
2 y 
3 li 
4 3¢ 
5 7. 
From Ex. 4-39. 
U, + U2 +0... Un 
S 


4.3. Interpolation Formulae 
(1) When the values of the 
are used : 


4,3-1. Newton’s divided diffe 
f(x) 


Derivation. Let f(x) be ¢ 
points xo, X1.... X, Which are no 


F(x, x0) 
Sf) 

_ £ Xo, *1) 
SG Xo) 


Similarly 
SF (x, Xo, X1) 
SF &, Xo Xt Xn—1) 


Multiplying eqs. by (x-. 
adding. 
F(x) 


where R 
This formula, due to Ne 
formula. When the values of, 
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ved A” Up. = 6{f(x) g(x)} = LHS. 
: Ex. 4-42. Sum the series 123+3°+5° +... upto n terms. 
Sol. U, = nth term = (2n -1)° 


‘AUp — x" AU,) Difference table is 
x Ur A A? iN 
2 5 H 
? Ue a. 1 1 Be 
se Ur | 2 a 98 a 48 
“YU 
0 i 3 125 218 120 48 
| : 
| 4 343 386 168 
5 729 
Io From Ex. 4-39. 
U, + U2 +.... U, = "Cy U; + "cy AU, +... +A"! U; 
i < Ss pete 564 ee 
2! 3! 
242 ; 4 MND (—2) (4-3) 48 
mero +. Uo 4! 
“*) = n{1+13 (n—1) +12(n?—30+2)+2(n? —6n? + 11n—6)} 
x2 5 = n{2n*—n} =n? (2n? - 1). 
t 
o* (1—x)? co tas 4.3. Interpolation Formulae 
| (1) When the values of the argument are not equidistant, the following two formulae 
\Up — x" AUy} “ are used : 
4,.3-1. Newton’s divided difference Formula 
Un} + | F(x) = f(%0) + (& Xo) F (Xo, x1) + (x — x0) (X — x1) f (Ro, 1, *2) 
++ (x —X9) (K-41) 0. Xn) S (XO, X1 Xn) 
Derivation. Let f(x) be a function which takes n values f (xo), f (x1) .... f Qn) at the 
points xo, x}.....x, which are not necessarily equidistant. Then by def 
f(x)-f(%0) 
fx) = 
I : x¥—X0 
2 &] 2(x) | F(%) = fo) + (% — x0) f (, x0) 
| F(x, x9)-F (20.1) 
f@ 9) = Ae 
X—Xy 
x at Sf (%, x0) = fXo, %1) + & — x1) F (, Xo, 1) 
2_ EF 2] f(x) Similarly 
Ff (x, Xo, 41) = f (Xo, X1, X2) + (& — X2) F(X, Xo, X1, X2) 
Hels : ;| Z e(x-4] F(%, Ko rye n= 1) = F (05 Ky 0 En) + (Aa) F (, Hon Hn) 
2 2 ‘ Multiplying eqs. by (x — Xo), (x — Xo) (% — *1), «.. & — Xo) @H— 41) eee (x — Xq_1) and 
; : adding. 
}} {r(x +3)- +-$)| F(x) = £0) + = 20) fo, 1) + (&— 40) — a) f Cn B12) 
2 2 +... + (X—X0) (4-41) «HH 1) Fos» Xn) +R 
h h | where R = (x—X9) (¥— 1) .... & — An) F(X, X05. Xn) 
2 & (x-4) : This formula, due to Newton, is called Newton’s. divided difference interpolation 


formula. When the values of f (x) for x = Xo, X1.... Xn are known the evaluation of f (x) is 


128 . MATHEMATICAL STATISTICS 
nL 


reduced to the problem of evaluation R. If it is known or negligible, the required value of f(x) 
can be calculated from above formula. In the case of a polynomial of nth degree, since (n + 
1)th order divided difference is zero, R = 0. 
‘ If f (x) is polynomial of nth degree. 
F(%) = £0) + & —X0) f (Xo, x1) + (& — Xo) (X — 41) f (Xo, 1, X2) 
F see (= XQ) (X= 4X1) oe Ha) F(X,» Xn) 
4.3-2. Lagrange’s Formula 


_ n n (x-x;) 
nays 2 ae I (x;-x,) 
j#i 


Drivation. Let f(x) be a function which takes value f (xo), f (x1),... f (Xn) for (n + 1) 
distinct points xo, x), .... X, and it is required to find a polynomial 
P(x) = agtayxt....ta,x" 
with the property that 
P(xi) = fd i=0, llowa 
The resulting polynomial is called Lagrange’s interpolation polynomial or formula. 
Evidently the unique polynomial P(x) (of degree < n) with the required property is 
P(x) = Lo(x) f(xo) + Li (x) f(x1) +... + Ly (x) f,,) 
where L; (x) is a polynomial of degree n in x with the property that 
L; (x) = 0 j#i 
= 1 ji 
Evidently Z; (x) has the form 
Lj(x) = A(x — Xo) &—¥1) ...  —¥j-1) (X—¥F 41)... (H—¥n) 
L; (x;) 


oe (xj — 0 ) (xj — 1)... (45 — Xj) (45 — X47 41) --- (4 — Xp) 
= 1 
II (x;-x;) 
j#0 
j#i 
nm (x-x;) 
L; = 
“ i (x; -x;) 
j#i 
Pa) = Ye T] S22 
izo [jo Cr -X) 
j#i 


Note. The polynomial P(x) is unique because if P(x) and Q(x) be two such polynomials, 
then {P(x) — O(x)}, a polynomial of degree < n, will have (n + 1) zeros Xo, x}, .... Xn Which is 
possible only when P(x) = Q(x). 

Ex. 4-43. Use Newton’ formula for unequal intervals to find f(8) from the following 
set of values : ; : 

x: 4 5 7 10 11 13 
Se): 2s 4 8 104 114 452 


DIFFERENCE OPERATORS AND IN 
DIFFERENCE OPERATORS AND” 


Sol. 
x Sx) 1st order 
4 2 
2 
5 4 
. 2 
7 8 
32 
10 104 
10 
11 114 
169 

13 452 _ 

: f(8) =2+( 
+(8- 
+(8- 

Ex. 4-44. Given the valu 

x: 4 § 

Kx): 48 10€ 
form the table of divided diffe 

Sol. 

x Tix) Ast orc 

4 48 52 
5 100 97 
7 294 202 

10 900 310 

11 1210 409 

13. 2028 


f(Q2) = 48+(2-4 

f(15) = 48+ (15 - 

+ (15-4) 

Ex. 4-45. The function 3 

1,2 and 4 respectively, Obtain 
value differs from 3° = 27. 
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le, the required value of f(x) 
ital of nth degree, since (n + 


:— Xo) (« — x1) f Xo, 1, X2) 
C— Xn) S Mo, «++ Xn) 


), SF 1),---f (Xn) for (n + 1) 
al 


i=0, lla 
1 polynomial or formula. 
the required property is 
+ Ln @)L,) 
hat 


1) — i+ 1). Hy) 
) 


be two such polynomials, 
‘CFOS Xo, X1, .... X, Which is 


d f(8) from the following 
1] 13 
114 452 
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Sol. 
Divided Difference Table 
x Sx) 1st order 2nd order 3rd order 4th order 5th order 
4 2 
2 
5 4 0 1 
5 
2 ~ ee 
23 = 5 
7 8 6 a5 .s 4 
32 — 
| 24 
10 104 = 
: 117 
10 —_— 
12 
11 114 53 
- 169 
13 452 — 


F(8) = 2+ (8-4) (2) + (8-4) (8-5) (0) 


—5 
+ (8— 4) (8-5) (8-7) (1) + (8 — 4) (8-5) (8~7) (8-10) (=) 


+ (8 — 4) (8-5) (8-7) (8 — 10) (8-11) (=) =« 


Ex. 4-44. Given the values 
‘x 4 a) 7 10 11 13 
Sx): 48 100 294 900 1210 2028 
form the table of divided differences and use it to obtain f(2) end f(15). 
Sol. 
Divided Difference Table 


x Sx) 1st order 2nd order 3rd order 4th order 5th order 
4 48 52 
5 100 97 15 1 
7 294 202 21 1 0 0 
10 900 310 27 ‘ 0 
11 1210 409 33 
13. 2028 


f(2) = 48 + (2-4) (52) + (2-4) (2-5) (15) + (2-4) (2-5) (2-7) (A) =4 
f(iS) = 48+ (15 —4) (52) + (15 - 4) (15 — 5) (15) 
+(15 — 4) (15 —5) (15-7) (1) = 3150. 
Ex. 4-45. The function 3* tables, as it should, the values 1,3, 9 and 81 when x equals 0, 
1,2 and 4 respectively, Obtain the value corresponding to x =3 and explain why the resulting 
value differs from 3° = 27. 
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Sol. ; 

Divided Difference Table 

x 3* lst order ‘2nd order 3rd order 

0 1 7 

1 3 6 2 ) 

2 9 36 10 

4 81 


Eee 
£(3) = 1+(3-0)2+ (3-0) 3-1) (2) + (3-0) GB - 1) (3-2) (2) =31 
Interpolating value differs from actual value because f(x) = 3* is not a polynomial. 
Ex. 4-46. The mode of a certain frequency curve y = f (x) is very near x = 9 and the 
values of the frequency density for x = 8-9, 9-0 and 9-3 are respectively equal to 0-30, 0-35 
and 0-25. Calculate the approximate value of the mode. 


Sol. 

Divided Difference Table 
x Six) 1st order 2nd order 
8-9 0-30 0:5 
9:0 0:35 0:33 — 2.08 
9:3 0:25 


J (x) = 0:30 + (x — 89) (0:5) + (x — 8-9) (x — 9-0) (-2:08) 
T(x) = Ne (2:08) {(x ~ 9:0) + (x - 8-9)} 
For modal value of x, /” (x) = 
0:5 — (2:08) {2x -— 17-9} = 
4:16x = i + 37:232 = 37-732 
= 9-07. 


Ex. 4-47. Use es s formula for interpolation to derive the form of the function 
y — (), given 


0 2 3 6 
f (x): 659 705 729 804 
Sol. Sx) * G=2)E—IE-$) E59) + (4-0)(x-3)@—6) (705) 


(0-2)(0-3)(0-6) 

2 (x-0)(x-—2)(x-6) 
(3-0)(3-2)(3-6) 
J 3,29 2 89 


(2-0)(2-3)(2-6) 


(729) + (x —0)(x-2)(x-3) 


(6—0)(6-2)(6~3) 8%) 


= xo +— x+ 659. 
72 72 4 
a 4-48. Use Lagrange’ formula to ‘ nd f (5) from the following data : 
2 3 6 i 
; a 1 5 is 61 125 


(S-3)(5—4)(5-6) (5-7) 
(2-3)(2—4)(2-6)(2-7) 
, G=25-3)(S-6)(5-7) 

(4-2)(4-3)(4-6)(4-7) 


(5—2)(5—4)(5-6)(5-7) 5 
(3-2)(3-4)(3-6)(3-7) (5) 
(5—2)(5-3)(5-4)(5-7) 
(6-2)(6—-3)(6-4)(6-7) 


Sol. (5) = (1) + 


(13) + (61) 
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(5-2)(5- 
(7-2)(7- 
Ex. 4-49. The following valu 
fC) =¢ 
Find the euslie of f (6) and al 
— (4=2)(x-7 
Sol. f(x) = (1-2)(0-7° 
(x~1)(x- 
(7-1)(7- 
1. 3 
= -=x°+=x 
6 2 
2 
fO*5 
fa) = 5x45 
Put (x) = 0 
a2 
cael fs 
1 


Since f” (x) =- A <0, f(x) i 
Ex. 4-50. The following tab. 


months of life: 
Age (in months) 0 
Weight (in lbs) 75 


Estimate the weight of the ba 


Sol. f(7) = 


Ex. 4-51. The observed val 
four positions 3,7, 9 and 10 of th 


_ give for the value of the function 
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ered airs ahatared Ale tenuis oictece 


1 order 3rd order 


3-1) (3-2) (2) =31 

= 3" is not a polynomial. 

‘) is very near x = 9 and the 
sectively equal to 0-30, 0-35 


39) (x ~ 90) (2-08) 
8-9)} 


ve the form of the function 


)(x-3)(x-6) 

)(2- 3)(2- 6) (705) 
~0)(x-2)(x-3) 
-0)(6—2)(6—3) 84) 


owing data : 
7 
125 
)(5-6)(5-7) 
1)(3-6)(3-7) 


3-3)(6-4)(6—7) eh) 
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eet 


(5-2)(5-3)(5-4)(5-6) 
(7-2)(7-3)(7-4)(7-6) 
Ex. 4-49. The following values of the function f (x) for values of x are given : 


FQ) = 4, f(2)=5, F(7Y=5, (8) = 4 
Find the value of f (6) and also the value of x for which f (x) is maximum. 


(x=2)(x-7(x-8) 4, (X-DE-DE-8) 


(125) = 28-6. 


Sol. £0) = “G=aya-nl=8)  * @=1)(2-7)(2-8) 
(x-I)(%-2)(%-8) 5 N (x-)(@-2)(x-7) , 
(7-1)(7-2)(7--8) (8-1)(8-2)(8-7) 
15 3.8 

= sx +ixge 
6 2 3 
17 
f= = 
f@) = 3x43 
Put f(x) = 
bse 


9 
Since i asa <0, f(x) is max forx=~,. 


Ex. 4-50. The following table gives the normal weights of babies during the first 12 


months of life: 
Age (in months) 0 2 5 8 10 12 
Weight (in lbs) i) 10-25 15 16 18 21 


Estimate the weight of the baby at the age of 7 months. 
7-2)(7-5)(7-8)(7—10)(7—12 
Sol IM = CR EEE 
eye je -ae a) e 
(7-0)(7-2)(7-8)(7—10)(7—12) 
(5-0)(5-2)(5-8)(5—10)(5—12) (1°) 
(7-0)(7-2)(7-5)(7—10)(7-12) 
(8-0)(8—2)(8-5)(8—10)(8—12) “19 
(7-0)(7-2)(7-5)(7—8)(7—12) 
(10—0)(10-2)(10—5)(10—8)(10—12) 
é (7-0)(7-2)(7-5)(7-8)(7—10) 
(12-0)(12-2)(12-5)(12—-8)(12 10) 
= 15-67, 


Ex. 4-51. The observed values of a function are respectively 168, 120, 72 and 63 at - 
four positions 3, 7,9 and 10 of the independent variable. What is the best estimate you can 


(7°5) 


(18) 


(21) 


. give for the value of the function at the position 6 of the independent variable ? 
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(6-7)(6-9)(6-10) 


Sol. JO a naa 

(6-3)(6—9)(6—10) (6-3)(6-7)(6—10) 

(7-37-9710) 129 * (9=3)(9-7)(9-10) 

(72) 
(6-3)(6-7)(6~9) 
(10-3)(10-7(10-9) °°) 
= 147, 

Ex. 4-52. Given the following table, find log, 656. 
x 654 658 659 661 
f()=logiox —: 2:8156 2-8182 2-8189 2-8202 
Ze 556) < (£582 958)(656—659)(656-661) |, 5 
as 4(656) = (654 —658)(654—659)(654—661) (“8199 


(656 ~- 654) (656 - 659) (656 ~ 661) 2.8182 
(658—654)(658-659)(658 - 661) ( ) 
" (656 - 654) (656 —658)(656- 661) 
’ (659 -654)(659 - 658)(659 — 661) 
(656 — 654) (656 — 658) (656 — 659) 92-8202 
(661-654)(661-658) (661-659) ( ) 
= 2-81681 =2 -8168. 
Ex. 4-53, Four equidistant values U_,, Uo, U,, and U; being given, a value is interpolated 
by Lagrange’s formula. Show that it may be written in the form. 
yy x(x? 


2 
D A? U_,+ 200) A? Uo 
3! 3! 


(28189) 


U,=yUo +xU, + 
where x+y = 1. 


2 
So. RHS. = (1-x) Up + xu + SANE (E-1) U4. 


+ 


x(x? —-1) 
aera (E-1)° Up. 


y 
fae) =) =U, 40) 


(1 —x) Up + xU; + 


2 ; 
+ 2 (Uz —2U, + Uo) 


Me 2 = 
= aoe? Ua+ ufc - Lr-nya-a2 =| 
2 —_ 
+U, {s + aes - 5" -»} + Uy eee | 
= x(x-1)(x-2) (x +1)(x-1)(x-2) 7 
_ (1-0) 1-1)(-1-2) " 4+1)(0-1)(0-2) ~° 


DIFFERENCE OPERATORS AND INTER 


(x+1)( 

(1+1)( 

Ex. 4-54. Given log 100 = 2 
2-0170, find fog 102. 


Sol. log 102 = ° 


(2) When the values of the arg 
4.3-3 Newton’s Forward Interpc 


ff) =. 
It is used to interpolate near tl 
Derivation. Lety=f(x) bea fi 


values Xo, X) «...- x, of x. Let 
I(x) = 


where the co-efficients ao, @), .... d 
for x = Xo, X1, «... Xn respectively. S 
Xj~~-Xo = 
oes xXj-Xo = 
Now I(xo) = 
Ke) = 


a, = 


I(%2) = 


a = 


Similarly, a3 = 


= 
Na) 
Ul 


Let U = 


Then 
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68) 


(6-3)(6-7)(6-10) 


(120) + (973-7 (9=10) 


) 
5) (63) 


659 661 
2:8189 2°8202 


656-661) 
654-661) (7°8156) 
9) (656-661) 
)(658—661) 
3)(656~661) 
3)(659—661) 
:) (656-659) 
')(661 - 659) 


(2-8182) 
(2-8189) 
(2-8202) 


ing given, a value is interpolated 
orm. 


A? Up 


at (E-1)P U4. 


2 
+ — (E-1) Uo. 


) (CU, 20+ Us) 


+ 


2 
| 
o—) (U —2U; + Up) 


2 
yx(-a)(x—2)+ 22) 


2_ 
oh ey, P=} 


+1)(x-1)(x-2) 
+1)(0-1)(0-2) Uo 
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ee 


(x+1)(x-0)(x-2) (x+1)(x-0)(x-1) 

(1+1)(1-0)(1-2) “1” (24+1)(2-0)(2-1) 

Ex. 4-54. Given log 100=2, log 101 =2-0043, Jog 103 = 2-0128, log 104 = 
2-0170, find fog 102. 


U, = U,;. 


(102 -101)(102 -103)(102-104) 
(100 —101)(100-103)(100—104) 

(102 -100)(102 -103)(102-104) 

(101-100)(101—103)(101—104) 
(102 - 100)(102 -101)(102 —104) 
(103 ~100)(103—101)(103~104) 
102 -100)(102 -101)(102 ~103 
SOA TIDOIONT TOR aIes anKe) 


= 2:0086. 
(2) When the values of the argument are equidistant the following formulae are used : 


Sol. log 102 = (2) 


(2-0043) 


(2-0128) 


4.3-3  Newton’s Forward Interpolation Formula 


2 n 
FQ) = yor U" Ayo + u A? yo +... + UY) ate 

It is used to interpolate near the beginning of the table. 

Derivation. Let y =/(x) be a function which assumes the values yo, ); ... y, for equidistant 
values Xo, x) ..... X, of x. Let 

I(x) = ag + ay (X— Xo) + a2 (x —X0) (Xx) 
Fee Fee + Oy (X = X0) (K-21) 0 M1) 

where the co-efficients ao, a1, .... @, are to be determined s.1./(x) takes the values yo, y1....Vn 
for x = Xo, X}, .... X, respectively. Since the values x; are equidistant, 


Xy—-X9 = XQ—-Xy =o... =Xn— Xn = h (say) 
Pe Xi-X) = th i=lwun 
Now I(xo) = ao0=yo 
Ke) = a9+ a, (1 —x0) = 
YN-yo _ 1 
a = = = 7 Ayo 


I(%2) = ao t+ ay (X2 — Xo) + a2 (%2 —X0) (X2 — 41) = V2 


1 1 
= i 2, +9) =—s 
a= 33 {y2 v1 yo} Ae 4 %O 


Similarly, 


(x-x9)(x-x1) 

ane 
trek (x-x9)(4-%}) .--. (Xn) any, 
nih" 


oe Sp Fw 
. kh 
(x= x9)(4=2).--@—%n) 


Then An 
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eG) Sy 


U(U~1)....(U~n=1) = U 


1 u® 
K(x) = yo U9 byt ZO ayy at A" yo. 


Ex. 4-55. Given the following pairs of corresponding values of x and y. 


x: 20 25 30 35 
y: 73 198 573 1198 
Find the estimated value of y Sor x= 22. 
Sol. Difference Table 


Un = 73+ (0-4) (125) + or 
= 93, 
Ex. 4.56. Given sin 45° = 0-7071, sin 50° = 0-7660, sin 55° = 0-8192 sin 60° = 0-8660. 
Find sin 48°. 
Sol. 


(250) 


Difference Table 


0:7071 + (0-6) (0-0589) + ) + OE O-8) 0-0057) 


w 
=} 
Lb 
oo 
f) 
ll 


% (0-6)(—0-4)(—1-4) 


51 (- 0-0007) 


0-7431. 


Ex. 4-57. Find the number of men getting wages between Rs. 10 and Rs. 15 from the 
following table : 


Wages per week (in Rs.) 0-10 10-20 20 — 30 30-40 
Frequency 9 30 35 42 


DIFFERENCE OPERATORS AND INTE 
aati acne 


Sol. Rewriting data in cumt 


No. of persons 
getting less than 


’. No. of men getting less tl 


9 + (0-5) (30) + 


+ 


‘. No. of men getting betw: 
Ex. 4-58. Use Newton ’ forn 
from the table given below : 
Age Annual 
20 0 
24 C 
Sol. 


Premiu 


‘. Premium at age 25 = 0- 


(1-25)(0-25) 
2! 
= 0-01625. 
Ex. 4-59. The following 
examination. 
Not more than... 


40 
45 
50 
55 


(0-( 
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A’ yo Raced o 


=i c i) = (250) 


in 55° = 0-8192 sin 60° = 0-8660. 


, (0-6)(-0-4) 
2! 


(— 0-0057) 


(- 0-0007) 


veen Rs. 10 and Rs. 15 from the 


20 - 30 30-40 
35 42 
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Sol. Rewriting data in cumulative frequency form and taking differences : 
Difference Table 


No. of persons 
getting less than 


*, No. of men getting less than Rs. 15 are 
(0-5)(-0:5) 
rT (5) 
(0-5)(-0-5)(-1-5) 
* 3! 
*, No. of men getting between Rs. 10 and Rs. 15. 
= 24-9=15, 
Ex. 4-58. Use Newton s formula for interpolation to find annual net premium at age 25 


9 + (0-5) (30) + 


(2) = 23-5 = 24. 


from the table given below : 


Age Annual Net premium Age Annual Net Premium — 
20 0-01427 28 0:01772 
24 0-01581 32 0-01996 
Sol. 
Difference Table 


eT Premium i Be 


0-01427 
0-00037 
~ 0-00004 
0-00033 


0-00154 


0-01581 


0-00191 


0-01772 
0-00224 


0-:01996 


*, Premium at age 25 = 0-01427 + (1:25) (0-00154) 
1-25)(0-25 1-25)(0-25)(-0-75 
+ fee?) (0-00037) + cee (— 0-00004) 
= 0-:01625. 
Ex. 4-59. The following are the marks obtained by 492 candidates in a certain 
examination. 
Not more than.... Candidates Not more than... Candidates 
40 212 60 460 
45 296 65 481 
50 368 70 490 


55 429 75 492 - 
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Find out the number of candidates who secured more than 42 but not more than 45 


marks. 
Sol. 
Difference Table 
x Freq. A D Ng A‘ Ng Ae A’ 
40 212 
84 
45 296 -12 
72 1 
50 368 -11 -20 
61 -19 59 ; 
55 429 -30 39 -—120 
31 20 -61 210. 
60 460 -10 —22 90 
21 -2 29 
65 481 -12 7 
9 5 
70 490 , ~7 
75 492 
42-40 
. = = 0-4 
", Number of candidates getting marks less than 42 
0-4) (-0-6 0-4) (-0:6) (-1-6 
= 212 + (0-4) (94) + CVS) “ ) ene oet. )( rT ds ) (1) 
0-4) (-0-6) (-1-6) (-2-6 
+ OD AO CHO 59 
0-4) (-0-6) (—1-6) (-2-6) (-3-6 
+ (0:4) (0-6) ( a )¢ ) (59) 
0-4) (-0-6) (-1-6) (-2-6) (-3-6) (-4-6 
+ PMA 2Y3-9(4-O) — J999 
0-4) (-0-6) (-1-6) (—2-6) (—3-6) (—4-6) (-5-6 
+ PME) 1-9) 2-9 3-4-9) 5-9) 940 
= 256 (approx.) 
.. Number of candidates getting marks more than 42 but not more than 45 = 296 — 256 
= 40. 
Ex. 4-60. Find f(0-0477) from the following data : 
x ; 0 0:05 0-10 0-15 0-20 
I(x) 1-00000 0-99750 099005 0-97775 0-96079 
Sol. 
Difference Table 
x f(x) A A? B IN 
0-00 1-00000 
— 0-00250 
0-05 0-99750 — 0:00495 
— 0-00745 0-00010 
0:10 0-99005 — 0:00485 0-00009 
— 001230 - 0-00019 . 
0-15 0:97775 -0- 
; — 0-:01696 
0-20 0-96079 


DIFFERENCE OPERATORS AND INTI 
pe ata a ace i ee 


U = 
f(0-0477) = 
Ex. 4-61. Given 
10 
Dd f@) = 
1 
10. 
> f@) = 
7 
find f(1). 

Sol. Let Sy = 
Then 
er a 

x Sx 
ae 
1 50042 
4 32924 
7 17521 
10 4036 

U 

for 5, x 
S2 

ie S; = 82 
=> f(Q) 


4.3-4. Newton’s Backward I) 


f@)=Int UY Vyn 


It is used to interpolate ne 
Derivation. Let y=f(x) bi 
values Xo, X1, «++: Xn Of x. Let 
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? than 42 but not more than 45 


59 : 

61 —120 510 
90 

29 


) (-0-6) (-1-6) 
So 


(- 120) 


=) (210) 


not more than 45 = 296 — 256 


= 40. 
0-15 0-20 
5 0-97775 0-96079 
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0-0477-0 
7 gos 
f(0-:0477) = 1-00000 + (0-954) (— 0:00250) 
0-954)(-0-046 
+ a (-0:00495) 
0-954)(-0-046)(-1-046 
4 Que (0-00010) 
0-954)(-0-046)( -1-046)(-2-046 
+ OE 20) IC M ) (0-00009) 
= 0:9977240 = 0-99772. 
Ex. 4-61. Given 
10 10 
> f(x) = 500424, > f(x) =329240 
1 4 
10 
> F(x) = 175212 and f(10) = 40365, 
7 
find f(A). 10 ; 
Sol. Let ss = > ft), 185x810 
Then : 
x Sy ASx A’Sx A’Sx 
1 500424 
-171184 
4 329240 17156 
— 154028 2025 
7 175212 19181 
— 134847 
10 40365 
a i 
: 1 
for 5», x = 2 andhence U= 3 
1 
S. = 500424 + 5 (- 171184) 
SIGS) anise «3 GS) 
SAN 37 (17156) + BAA 3 2A 3 7 (2025) 
2! 3! 
= S;—58842-55 
os S,;-S_ = 58842-55 
=> F(1) = 58842-55. 
4.3-4. Newton’s Backward Interpolation Formula 


u™ 


n! 


y®) 
f(x) = Yn + UY Von + a Vy_ + oe + V"y/ 


It is used to interpolate near the end of the table. 
Derivation. Let y= f(x) be a function which assumes the values yo, y}...y, for equidistant 
values Xo, X1, ...; X, of x. Let 
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K(x) = ao + ay (X—X_) + a2 (X —Xpq) (X—Xp_1) F oF Oy (X—Xp) (XK — Xp) :.-. (X— 2X1) where 
the co-efficients ao, aj,..... 2, are to be determined s.t. I(x) takes the values yo, y; .... ¥x for 
X = Xo, X1,... X, respectively. 


Let XxX, —Xo = X2—-X = =X_,—Xn-1=h 
z xj-X0 = th i=1,..0 
New Ip) = 40=Yn 


In-1) = agta, (Xn-1 — Xn) 


1 1 
a, = y (Wan =F Vyn 


T(%n-2) = Go + ay (Xn-2 —Xn) + Az (Xn-2 — Xn) (Xn-2 —Xn-1) 


1 1 
G2 = 5 yt (Ya Wott Ye2k = Vn 

re 1 3 1 n 

Similarly a3 = Te Vo Vy yerere Qn = rT V" Yn 
(x-Xn) ,2= Xn)(x- Xn-1) 2 
= Vv el 
I(x) Yat nt rhe Vry 
(x~-Xq).. (X= 21) oy 
rae nth” we 
X-Xy ; 
Let U = a ie, x=X,+ Uh 
Then (x—x,) ....(x—%1) = h” UU41)......(Ut 0-1) 
= ku” 
where U”) = UUt1) (U+n-1) 
(2) (n) 
I(x) s Yat UO Vy, + Vy, + ecnee Ph F V" yn 
n 
where U” = UU+1)....(U+ n=l). 

Ex. 4-62. Estimate the population in 1995 of a place having the following record. 
Year 1961 1971 1981 1991 2001 
Population 
(in thousands) 46 66 81 93 101 
Sol. Since 1975 is near the end of the table. Newton’s backward formula will be used. 


Difference Table 


_ 1995-2001 


1981 
1991 


(—0-6)(0-4) 
2! 


", Population in 1995 = 101 + (— 0-6) (8) + (- 4) 


Ese ian 


DIFFERENCE OPERATORS AND INTERI 
erect a Ra See SSE 


pie 0- ue 4)(1- ye 

= ee GaeOe 

43-5. Central Difference Formul 
(a) F(x) = yor Upd x 


+ Septic gaeeaioananind 
4! 

U(U? - 

Pi lac 


(6) f@) = Wi ty 


1 

where V = U-> 
(©) fe) = yt UY 
(2r- 


It is used when U is negative 
1) 
(fe) = wot U 


(U+r 
C 


x 


It is used when U is positive 
Central difference formulae 


Ex. 4-63. Find the value of 


Rate per cent An 
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—Xn) (X —Xpy_1) «.... (X— x1) Where 
takes the values yo, yj .... y, for 


=h 


n-2 ~ Xn) %n-2 - Xn-1) 


Xn\(*X-Xn-1) G2 
aneSSCY 


Di 


aving the following record. 
1991 2001 


93 101 
ackward formula will be used. 


1995_— 
y= 1995-2001 
=e 10 


! 
? (-4) 
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4 €9-6)0-4)01-4) 
3! 4! 


(-0-6)(0-4)(1-4)(2-4) 
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(- 3) 


= 101—4-8 + 0-48 + 0-056 + 0-1008 = 96-8368 = 96-84 thousands. 


4.3-5. Central Difference Formulae 


U? U(U? -1 
(a) f(x) = Yor Ups yo + a & yor a LS yo 
u2(U*-1 2(u? -1)....(U? -(r-1)" 
C3 ei ns 
4! 2r!} 
U(U? -12)....(U? =r? : 
oe y82"*! yo + .... (Stirling’s formula) 
1 > 1 
(d) f@) = By + Vy +5) (12-2 )n8? 1 
2 2 : 2 
pe (v2 2) 6%1 + re ( >) (v>-2) 
3! 4)". 2r! 4 4 
12 
y2? Orb istry. 4 (v2-2\(17-2).. 
4 +" Gre)! 4 4 
2 (2r-1)" | cart 
{v aa 5" WiC seestaes (Bessel’s formula) 
2 
1 
where V= U-> 
+12 U+rje” 
(c) f(@) = y+ UY By 1 +O 8 y0 +. 3 yo 
2 ’ : 
(2r+1) 
Ga gerthy i Fite (Gauss backward formula) 
r+l1)! sk 
, 2 
It is used when U is negative. 
(2) 
x) = yo UM By) +870 + 
@ fey 148 y0 
5 ! 


(U+r-H2"-) geet, , Ut 
(Qr-D! a 


~—4\(2 
7-1)? sir 


It is used when U is positive. 


ar! Yor cous 


(Gauss forward formula) 


Central difference formulae are used to interpolate near the middle of the table. 


3 ‘ 
Ex. 4-63. Find the value of an annuity at 53% from the following table : 


Rate per cent Annuity-value Rate per cent Annuity-value 
1 
4 17-:29203 55 14-53375 
1 
45 16-28889 6 13-76483 
5 15-37245 


140 : MATHEMATICAL STATISTICS 


mo artcarican stansmics | 


DIFFERENCE OPERATORS AND IN1 
pz tae rece a a a 


Sol. Since 5 3% lies near the middle of the table, any central difference formula can be | where - V = U- 5 
applied. 
‘ yoryi 
Difference Table = 9 
Rate Annuity-value A a AS At 1 r( 
4-0 17-29203 6 
— 1:00314 
4.5 16-28889 0-08670 | re | 
— 0-:91644 —0-00896 + —]) 
5-0 15:37245 0-07774 0-00100 24 
5.5 1453375 ~ 838 0 neg7g ~ 0700796 | V = 0:75-( 
— 076892 | ii 
6-0 13-76483 | ioe 
: | x 
Here | Xo = 5:0; h=0-5 1 
+= 7 
5 ~5 2 { 
= ——=0-75 1 
(i) Using Stirling’s formula. 1 
eon 
_ U2. | U(U*-1)_ 5, U*>(u?-1) 24 
ee at eae ee. ae ee 7 = 14953 
= 14-736: 
2 2 
= yot UpAy | + ue AZ yy ,UU* =) pa? y-32 Ex. 4-64. From the we 
age al 31 Age . 
2(y2— Annual Premium 
+ “= At yo suse (in Rs.) 
5 A U2 u(U? 4) { By_,+A3y_3 ae 
7 yotAy-i| _U", = Y1+4 y-2 : 
= yot yp {mora } + T A’ y+ 31 | 2 | Premium y 
2 772 
SD es 
4! e 
_— . _ . . 2 
= 15-37245 + (0-75) {eens} + Oe (0-07774) 
. (0-75) {(0-75)* -}} {=2-00706=0- 00896] 
3! 2 
0-75)" : 
+ (0-75)" {(0-75)" -}} (0-00100) 
4! By Stirling’s formula, we 
= 14-736589 = 14-73659. 30-75 
(ii) Using Bessel’s formula. | nee 
i ee | eee seem | 2 
Ye = Bin + vy. + Py (v -;) u& yin + 3 v(v -3} 8S yin + QO. 


(0. 


2 yfy2_1)\fy2_2) 13 
+ rT (vr i) (v >} no Jint.... + 
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mntral difference formula can be 


A A‘ 


-0:00896 


0-0010 
- 0-00796 ° 


u(u? -1) A’y_| +A y_y 
3! 2 


1644) (0-75)? 
——¢ + ~~ (0-07774) 


2! 


0- mee 


_ 15-37245+14 -53375 


Annual Premium 


yo2rs = 32-75 + (0-25) { 
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; 2 2 
Yorn , Vayy +1? -;) A*yo +A y-1 
2 2! 4 2 


4 4 
+ = (”? a" -*) Ary tA y-2 +... 
24 4 4 2 
0:75 — 0-5 = 0:25 


: + (0-25) (-0:83870) 


: 5 {(0-25)? -0-25)} {M0078 0.0777 


2 


ae ; (0-25) {(0:25)? — 0-25} (-0-00796) 


+ 7 {(0:25)" — 0-25} {(0-25)? — 2:25} (0-00100) 


14:95310 — 0:209675 — 0-0069150 + 0-0000622 + 0-0000171 
14-736589 = 14-73659. 


Ex. 4-64. From the following data, find the annual premium at the age of 33. 


24 28 32 36 40 
28-06 30-19 32-75 35-94 40-00 


By Stirling’s formula, we have 


96)\2 
3:19+2 sh, (0-25) (0-63) 


2 2! 
‘ (0-25) {(0-25)? -}} {2-20-20} 
3! 2 


4 (0-25) (0-25)? -B 


41 (0-04) 
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= 32-75 + 0-719 + 0-02 — 0-009 — 0-000 
= 33-48, 
Ex. 4-65. Show that 
us ‘3 25(c—b)+3(a-c) 
2 256 
where a = yotys, b=y1 + ya, =C=Y2 +s if A’y, are constant. 
Sol. From Bessel’s formula , 


2 2 
yx = Torn 4 raro+s(V?—5) (* Yots 1] 


¥i1 = 


2! 4 2 


Wee eee ge (v1) 
=) Ves Py ue | a= 
+37 s)eraitg AR 


(*- 3) Aty_;+Aty_2 
4 2 


1 2 1 2 *) 
—y{y2-=){ 2-2] as 
* 10 ( a) 4) % % 


Let Xo = 2 
Then Yo = Yo ¥-1=My» ¥-2=Yo Vi = V3 = Ya and Y3 = ys. 
1 1 1 1. 
Also Vo = U-==(x-x)-==|2—-2|-— = 
: 5 = =m) - 5 =(25-2)-5 
Yor 1 ; 
ya = yy = (e-1 yyt (E- 1) 1) 
2- = 2 16 
2 2 
+2 (E-1)' y+ (E-1)* ya} 
768 
Yoty 1 
= agra ge {(% - 2% + Yo) + (M1 — 20 + ¥-1)} 
9 
+ 768 {(¥3 — 42 + 6y1 — 4¥0 + ¥-1) + (2 — 4 + OY0 - 47-1 + ¥-2) 
Y2+¥3 


1 
a a { ya-Y3-Y2 + yi} 


9 
+ 568 {ys — 3yq4 + 2y3 + 2y2 — 3y1 + Yo} 


c 1 3 
= ee fa pale ee 
ie ee 
= 5+ 55g Bla-e) + 2510-0), 


Ex. 4-66. If third differences are constant, show that 
x+ = 


1 1 ; 
Yaad Ty et ver 75 Ayer tai yy, 


- Sol. From Bessel’s formula 


ee: 
% = tot + Voyo+ (v?-3} [toss foxe 13] 


DIFFERENCE OPERATORS AND INTE 
Ea ee 


Let Xo = x 

Then Yo = Vx Vi.= yx 

1 
v(t) 

1 
and V= U-5 = 
Yx t¥x+ 

"ett Fa 2 


Ex. 4-67. The following tabl 
of workers in a big manufacturin 
Earnings per month 
up to Rs. 10 
up to Rs. 20 
up to Rs. 30 
Find out the number of worl 


Sol. 


5)= 25-30 
U(for x = 25)= “Ao 
5)= 35-30 

U(for x = 35)= ae 
For x = 25 since U is nega 
y-o5 = 300 +1 

pict 

= 300- 


For x = 35, since U is pos: 
have 


yos = 300 + 


_@ 


MATHEMATICAL STATISTICS 


10 


“° ‘yx are constant. 


Yota?y-1 
2 


)x 


"Yo + ¥-1)} 


(2 - 4% + 6yo — 4y4 + Y-2) 


+Ay_, 
2 
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1 1 
+5, vv? 2a , 
Let X =x 
Then Yo = Vo YF Yx+1. Yi =Ye-1, ele. 
eae 
2 2 
ae eee 
and V= U- , =0 
Yx t+ ¥x41 1 
on = ng Wt Mx 


Ex. 4-67. The following table relates to income earned per month by a certain number 
of workers in a big manufacturing concern. 


Earnings per month Freq. Earnings per month Freq. 
up to Rs. 10 50 up to Rs. 40 500 
up to Rs. 20 150 up to Rs. 50 Me 700 
up to Rs. 30 300 up to Rs. 60 800 
Find out the number of workers falling within the Rs. 25-35 earning group. 
Sol. 


for x = 25)= -0-5 
U(for x ) 10 
35-30 
U(for x = 35)= =i 0-5. 
For x = 25 since U is negative, we apply Gauss’s backward formula. By this formula. 
0-5 
y-os = 300+ (— 0-5) (150) + coe?) (50) 


— . . 2 _ _ . . 2 _ . 
gi e0e5) i 5)*-}} (0) 9:5) i I} 1-5} (~ 50) 
= 300-75 — 6-25 — 1:17 = 217-58 = 218. 
For x = 35, since U is positive, we apply Gauss’s forward formula. By this formula we 


have 
| os = 300+ (05) (200) + LES") (59) 
: 
| , 05) CE) (50) + £02D0- a DEUS ¢ soy 
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= 300 + 100 - 6-25 + 3-125 - 1-17 
= 395-705 = 396, 


.. No. of persons earning between Rs. 25 and Rs. 35 = 396 — 218 = 178. 
Ex. 4-68. [fp, q, 1, s, be the successive entries corresponding to equidistant arguments 
in a table, show that when third differences are taken into account, the entry corres-ponding 


1 
to the argument half way between the arguments of q andr is A + 34 B, where A is the A.M. 
of q and r and B is the A.M. of 3q - 2p —s and 3r- 2s - p. 
Sol. In Ex. 4-66, Ict 
Yet = Py x= Qs Yor =r and yr =s 
qtr 1 
Thea yg Sag E-  E= Dye 


X+— 


en ae { Yrt2 — Vert —Yx + Yx-1} 
1 
= — —-r-gt 
A 6 {s—r-q+p} 


1 
Also B= 5 {(3q - 2p —s) + 3r—2s-p)} 


3 
= 5 @tr-p-s) 


1 
= +— x 
Mein Ae age 


To find Missing Terms 

Ex. 4-69. Given Up + Ug = 1:9243, U, + U7 = 1-9590, Uz + Us = 1-9823 and U; + Us 
= 1-9956. Find the U4. 

Sol. Since eight entries are given, from these values a polynomial of degree seven can 
be obtained and hence A’U, is assumed to be constant and consequently A°U, = 0 for all x. 

fe A*Uy = 0 

ie, (E-1)®Uj)= 0 

i.e, (E*—8E’ + 28° ~ 56E° + 70E* — 56E° + 28E” — 8E + 1) Up =0 

Le, (Ug + Up) - 8(U7 + U1) + 28 (U6 + U2) — 56 (Us + U3) + 70U, = 0 

a 70U4 = -1-9243 + 8 (1-9590) — 28(1-9823) + 56(1-9956) 


= 69-9969 
69-9969 
Us = ~ ae 0-999956 = 1-0000. 
Ex. 4-70. Find the missing terms in the following data : 
x: 2:0 2-1 2-2 2:3 2-4 2:5 2:6 
y: 13-5 ?. 11-1 10 ? 8-2 7-4 


Sol. Taking the missing entries as x and y the difference table is given below : 


x- 13-5 
21 x 24-6 — 2x 

11-1-x 3x — 36:8 
2:2 11-1 x — 12:2 y—4x+40.1 

-11 y-xt33 ‘5x - Sy - 16.3 
2:3 10 y~-89 x—4y + 23-8 


DIFFERENCE OPERATORS AND INT 


x y A 
y-10 
2-4 
8:2-y 
2°5 8:2 
— 0-8 
2:6 7:4 
Taking Ab y=0 
5x — Sy- 
-x+10 
A x= 12:3 
Ex. 4-71. Find the value of 
x ; 2 
yo: 1 
Sol. 
x y A 
2 1 
4 
3 $ 
8 
4 13 
x-13 
5 x 
61-x 
6 61 
64 
7 125 


Assuming A° y= 0, x = 28 
Ex. 4-72. The following t 
children born per mother. Find ; 


years. 

Ageofmothers : 15 
No. of children _ : ( 
Sol. 


Age of mother in 
years 


15-19 
20 — 24 
25 ~ 29 
30 — 34 
35 ~39 
40-44 — 


A°’yo = 0 or ( 
ys — Sy4 + LOy3 — 10) 
5-8 — 5(5-7) + 10y3 - 

or y3 = 479. 
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x y A Ng Ae x A’ 
= 396 - 218 = 178. : yan eee pane 
nding to equidistant arguments a id 8-2 paws 3y ~27-2 ss 
‘count, the entry corres-ponding 2-5 8-2 “a y-9 - 
sAt+ 34 B, where A is the A.M. — 08 
2-6 7-4 
Taking A’ y=0 


S5x-Sy-163 =0 
-x+ 10y—78-1=0 


xt} eee x=12:3 and y =9-04. 

Ex. 4-71. Find the value of y for x = 5 from the set of values 

x4 2 3 4 6 7 

yo: 1 5 13 61 125 

ee ee 
x y A A? Ae A‘ AS 
1 ; 
4 
3 5 4 
8 x-25 
4 13 x-21 120 ~ 4x 
x-13 95 — 3x 10x ~ 286 
U2 + Us = 1-9823 and U; + Us : 61-x ee 3x-71 is 
. 6 61 x+3 

olynomial of degree seven can 64° 
onsequently A®U, = 0 for all x. 7 125 

Assuming A° y =0, x = 28-6. 
8E+1)U=0 — Ex. 4-72. The following table gives the age of mother and the average number of 
- Us) + 70U, =0 children born per mother. Find the average number of children born per mother age 30-34 
+ 56(1-9956) Bde , 

Ageofmothers : 15-19 20-24 25-29 30-34 35-39 40-44 
| No. of children : 0-7 21 3-5 ? 5-7 5-8 
| Sol. 

Age of mother in 
4 2-5 2-6 years 
? 8-2 7-4 


‘table is given below : 


\4 Ae 


I 


A’yy) = 0 or (E-1)°yo=0. | 

'~ 4x + 40.1 “2 Ys—Sy4 + 10y3 — 10y2 + 5y1 — yo = 0 
5x~S5y-16.3 _ J. 5:8 — 5(5-7) + 10y3— 10(3-5) + 5(2:1)— 0-7 = 0. 
or : 3 = 4-79. 


‘— 4y + 23-8 
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Ex: 4-73. Interpolate the missing figure in the following table of rice cultivation : 


Acres (in Millions) Acres (in Millions) 


As two missing terms are to be determined, two equations are needed. We take them to 
_be 


A’yo = 0 (1) 
and A’y,; = 0 (2) 
Eqs. (1) and (2) give 
y7— Tyg + 21ys — 35y4 + 35y3 — 21y2 + 71 ~ yo = =( 
and ye ~ 7y7 + Aye — 35y5 + 35y4 - 2193 + Ty2-y1 = 0 
: 21lys—21y. = — 162-7 
or 35ys—7y. = 1642-1 


ie ys = 60:58 = 60-6 
and y2 = 68:3 
Ex. 4-74. Estimate the production for the years 1985 and 1985 with the help of the 
following table : 
Year : 1970 1975 1980 1985 ~ 1990 1995 2000 
Production in 
000,000 units: 200 220 260 ? 350 at . 430 
Sol. 


Wetake Avy = 0 
and A’ y, = 0 
ie, ys — Sy4 + 10y3 - 10y2 + Sy) — yo = 0 
and Yo — Sys + 10y4 — 10y3 + 5y2-y, = 0 


sie yst 10y3 = °3450 
and Sys + 10y3 = 5010 


DIFFERENCE OPERATORS AND IN’ 


= 390 
Ex. 4-75. interpolate U2 fi 
x 
U; 


_and explain why the value obta 


expression 2° + 5. 
Sol. 


We take At yo = 0 
ie, y4—4y3 + 6y2-4y1- 
: y= 9-5, 


1. If (i) f(x) =x""! show tha 
f Xo; ies Xn) = Xo 


(ii) f(x) = y show that 


f (0, Xi] veceee Xn) = 


(iii) Show that nth order 
and higher order dix 

2. Show that 
A" (ax" + bx") =n 

3. Prove that 
U, - User t U,+2° 


se lv ‘ oe iat 
oo 
4. Show that 
Us, — "cy 2! Usn—1 
where ~U, 


and the interval of 


aes Values of 


4 3x2) + x) _ 
2x? — 3x + 3x- 10 
6. Repreen the followin 
(1) x’ ~ 12x? + 42x? — 
(2) x¢ —3x° + 2x + 6. 
7, Find a cubic function c 
respectively. 
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ing table of rice cultivation : 


Acres (in Millions) 


ys are needed. We take them to 


1) 
(2) 


oo 


and 1985 with the help of the 
1990 1995 2000 


350 ==? _. 430 
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= 390 and y; = 306. 
Ex. 4-75. Inerpolate U, Ee the following table : 
x : 2 3 4 5 
U; : ; ? 13 21 37 


_ and explain why the alae obtained is different from that obtained by putting x = 2 in the 


expression 2* + 5. 
Sol. 


We take A‘ yo = 0 
ie, y4a—4y3 + 6y2 —4y1 + Yo = 
a yu 9:5. 


EXERCISES 


1. If () Ax) =x"*' show that 
So, a site Xn) =Xot Xt ee tXq 


(ii) f@)= at show that 


SF (X05 X15 seve Xn) = 


(iii) Show that nth order divided differences of a nth degree polynomial are constant 
and higher order divided differences are zero. 
2. Show that 
A" (ax" + bx™')=n!a (h=1) 
3. Prove that 
U, — Une + Ogea t+ Upson ti. 


2 GEER gee Usd 
21 x 8 ars 2! 8 x= 
4. Show that : 
Uoy — "C1 2 Ury— 1 + "C2 2? Ugg ove + (-1)" 2" Uy = (-1)" {e -2 an} 
where ~U, = axt+bxte 


a and the interval of differencing is unity. 
F 


. Find Values of 
pe 3x9 +2 _7 at x=5. [Ans. 298] 


2x? — 3x? + 3x-10 at x=5. [Ans. 180] 
6. Represent the following polynomials in the factorial notations. ° 
(1) x — 12x + 42x” - 30x +9 [Ans. x — 6x© + 13x@) + x + 9] 
(2) x4-3x° +2 +6. [Ans. x + 6x + 4x + 6] 
7. Find a cubic function of x which has the values 1, — 3, — 1, 13. when x = 1, 2, 3, ‘a 
' respectively. [Ans. 5 — 2x — 3x" +x°] 
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8. Sum the series 


10. 


11. 


12 


13. 


14, 


15. 


16. 


(2) 17.27+27.37+37.4? +.... upto # terms. [ Ans. 7 (3n4 +15? +25n? +15n +2] 


. Apply Lagrange’s formula to find f(5) and (6) given that 


x : 1 2 3 4 7 
Sx): 2 4 8 16 128 
and explain why the result differ from those obtained by completing the series of 
powers of 2. {Ans. 32-9 and 66-7] 
If P(x) = ao + a; cos 2x + a2 sin 2x agrees with f(x) when x = Xo, x; x2 then prove that 
sin(x —x,)sin(x—x3) 


: Xo) + two similar terms. 
sin(xg —x;)sin(x9 -x2) F (20) 


P(x) = 


Use Newton’s formula for interpolation to find the annual premium at the age of 33 
from the table given below : 

Age 24 28 32 36 40 

Annual premium 

(in Rs.) 28:06 30:19 32:75 35:94 40-00 


{Ans, 33-48] 
From the following table estimate, by using Newton’s formula, the premium payable 
at the age of 22 years : 


Age (in yrs.) 20 42 30 «35 40 = 8 45 
Premium (in Rs.) 25 28 32 37 43:5 52:25 
[Ans. 26-05] 


Use Newton’s formula to find the annual premium payable at the age of 26 years 
from the following table giving the annual premiums charged by an insurance company 
for a policy of Rs. 1,000: 


Age nex birthday : 20 25 30 35 40 
Annual premium (in Rs.) : 23 26 30 35 42 
{Ans. 26-73] 

Using Newton’s formula for interpolation estimate the population for the year 1965 : 

Year Population 

1951 98,754 

1961 132,285 

1971 168,076 

1981 195,690 

1991 246,050 


[Ans. 1,47,841] 
From the following information find the number of students who obtained less than 
45 Marks : 
Marks 30-40 40-50 50-60 60-70 70-80 
Frequency 31 42 51 35 31 [Ans. 48] 
Determine the number of workers earning Rs. 124 or more but less than Rs. 125°from 
the following data : 


* Earnings No. of workers Earnings No. of workers 
less than Rs. less than Rs. 
120 ; 276 135 918 
125 599 ; 140 966 


130 804 [Ans. 54] 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


am 


26. 


DIFFERENCE OPERATORS AND INTER 
pee tn ee ee 


From the following table es 
Rs. 60 and Rs. 70. 


Wage No. « 

(in Rs.) (in t. 
Below 40 
. 40 — 60 
60 — 80 


The following table relates t 
in a big manufacturing conc 
Earnings No. 
per day 
up to Rs. 10 
up to Rs. 20 
up to Rs. 30 
Find the number of workers 


Find f (4) from table below 
x 3 1 2 
FQ): 2 4 


Find (0:6538) using the foll 


Xx F(x) 


0-62 06194114 
0-63 0-6270463 


Find log 324 using the follc 
x : 310 
logx: 2-491362 2 


Use Stirling’s formula to ot 
X38 0 0- 
FQ): 0 0-19 


If J, represents the number . 
will permit. /, for x =.35, 4 


Me * ‘20 ‘ 
lee 512 4. 
274) 

Find 12516 , using Gaus: 
Xo 2 12500 
Vx: 111-803399 


Use Stirling’s formula to f 
29 = 49225, 
435 = 45926 and 


Estimate the production o: 
year 1991 1 
In millions 

of bales 17:1 1 
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s. rT} (3n3 +10n? +21n+38] 


(3n4 +15n? +25n? +1542] 


lat 
4 7 
16 128 
. by completing the series of 
[Ans. 32:9 and 66-7] 
MX = Xo, X1 X2 then prove that 


ilar terms. 


ual premium at the age of 33 


40 
4 40-00 
(Ans, 33-48] 
wrmula, the premium payable 
40 45 
43-5  §2-25 
[Ans. 26-05] 


rable at the age of 26 years 
ged by an insurance company 


35 40 
35 42 
[Ans. 26-73] 


dpulation for the year 1965 : 


. (Ans. 1,47,841] 
ents who obtained less than 


) 


[Ans. 48] 
e but less than Rs. 125‘from 
AY No. of workers 
918 
966 
[Ans. 54] 
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17. From the following table estimate the number of persons earning wages between 
Rs. 60 and Rs. 70. 
Wage 
(in Rs.) 
Below 40 
40 — 60 
60 — 80 
The following table relates to income earned per day by a certain number of workers 
in a big manufacturing concern : 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


No. of persons 
(in thousands) 
250 
120 


100 


Wage 
(in Rs.) 
80 — 100. 
100 — 120 


No. of persons 
(in thousands) 


70 
60 
[Ans. 


54] 


Earnings No. of workers Earnings No. of workers 
per day per day 
up to Rs. 10 50 upto Rs. 40 500 
up to Rs. 20 150 upto Rs. 50 700 
up to Rs. 30 300 upto Rs. 60 — 800 
Find the number of workers falling within the Rs. 25-35 earning group. [Ans. 178] 
Find f (4) from table below : 
x 4 1 2 3 5 6 7 
FQ): 2 4 8 32 64 128 [Ans. 16-1] 
Find (0-6538) using the following data : 
x f(x) x — £@) x f(x) 
0-62 0-6194114 0-64 0-6345857 0-67 0-6566275 
0-63 0-6270463 0-65 0-6420292 0-68 0-6637820 
0-66 0-6493765 
[Ans. 0:6448325] 
Find log 324 using the following data : 
x : 310 320 330 340 350 
logx:  2-491362 2-505150 2518514 = 2.531479 2:544068 


Use Stirling’s formula to obtain f(1-22) from the following data : 


x: 
FQ): 


will permit. /, for 


0 
0 


0-5 


1-0 


1-5 


2:0 


2:5 


[Ans. 2:-4510545] 


3-0 


0-19146 0-34134 0-45319 0-47725 0-49379 0-49865 
[Ans. 0-38871] 
If], represents the number living at age x ina life table, find, as accurately as the data 


x =.35, 42 and 47. Given: 


12520 
111-892806 


439 = 47 


1995 


? 


[Ans. 395, 326, 


12530 
111-937483 


[Ans. 111-874929] 


236, 


1996 


12-4 


Bae 20 30 40 50 

| 512 439 346 243 

274] 

Find 712516 , using Gauss’s backward formula, from the following data : 
x 12500 12510 

Vx: 111-803399 111-848111 

Use Stirling’s formula to find us, given that 

429 = 49225, u25 = 48316, 

u35 = 45926 and Ugg = 44306 

Estimate the production of cotton in the year 1995 from the data given below : 
year 1991 1992 1993 1994 

In millions 

of bales 17:1 13-0 14-0 9-6 


OW) 


Numerical Differentiation and Integration 


5.1. Numerical Differentiation 

It is the process of finding the derivatives of a function which may not be given in 
explicit mathematical form but for which a certain set of values are given. The procedure is 
to represent the function by an interpolation formula and then to differentiate this formula 
as many times as desired. 

Rules of representing the function by an interpolation formula. 

Argument Values Formula use 


Newton's forward (for differentiating near the 


beginning of the table) 
(near the end) 
(near the middle) 


Equidistant Newton's backward 


Stirling's or Bessel's 
Newton's divided difference 
Non-equidistant or 
Lagrange's 


Ex. 5-1. Find the first and second order derivatives of the function tabulated below at 
the points x = 0, 0-03 and 0-06. 


x : 0 0-01 0-02 0-03 0-04 0-05 0-06 
f@™ : 0 00301  0-:0604 0:0909 0:1216 0:1525 0:1836 
Sol. 
Difference Table 
x f(x) A A? 
0:00 -  0-0000 
0-0301 

0-01 0-0301 ee 0-0002 

0-02 0:0604 6.6308 - 0-0002 

0-03 0-0909 Sonar 0-0002 

0-04 0-1216 © Beas 0-0002 

0-05 0-1525 are 0-0002 

0-06 0-1836 
(i) Atx=0 


Since the differentiation is to be done near the beginning of the table, Newton’s forward 
formula will be used. By the said formula, 


f(x) = £0) + UA (%o) + —— 


arr » 92 £000) 


“Cy May 1626 


NUMERICAL DERERE Eee 7 
a f 


H U=-= 
ere = 0 

d 

f@ == 

= 1 

f'(0) = 1 

= 1 

Also f"(0) = { 


(ii) At x =0-03. 
Since the differentiation is t 
formula can be used. By Stirlin; 


I) = fl 

am i 

+ 

= f\ 

Here U = . 
I’) = i 

= jl 

F(0-03) = 1 

= 3. 

(0-03) = 


(iii) At x =0-06. 
Since differentiation is to be 
will be used. By the said formu 


fx) =f 

x 

Here . U=- 
fx) = 1 


id Integration 


on which may not be given in 
‘ues are given. The procedure is 
1en to differentiate this formula 


on formula. 
Formula use 


(for differentiating near the 
beginning of the table) 
(near the end) 

(near the middle) 


the function tabulated below at 


0-04 0-05 0-06 
0:1216  0:1525  0-1836 


A 


0-0002 
0-0002 
0-0002 
0-0002 
0-0002 


rof the table, Newton’s forward 


) 0? £00) 


Net Or were 


Geet Se een i ae 
Cn May (692-05 Dee. FAQ) 
NUMERICAL DIFFERENTIATION A, ND NTRORATION 


Here U = 0-01 = 100x 
ey = LO) A100 aftso)+ Sto? fero)| 


= 100 {0-0301 + (U- 0:5) och 
Ff’) = 100{0-0301 + (U— 0-5) (0-0002)} y-o 
= 100 {0-0301 — 0-0001} =3 
Also f’'(0) = (oe . at =(100)? {0-0002) = 2 
dU dk 
x=0 
(ii) At x =0-03. 
Since the differentiation is to be done near the middle of the table, any central difference 
formula can be used. By Stirling’s formula, 


wur=0) Ud? f(%0) +. 


2 
F(X) = fo) + UpS/ (a0) + 5-87 Flap) 
1/2 -1/2 : 
= flat v EE oy (apt) 4 © a f09-m 


1 
UU? -1) B7+E2 3h 
eae Ne 


2 
Flas) + (Alaa) + Af o0— My} + Za? flay A) + 


x-0-03 
Here U = 0-01 = 100x -3 


f(x) = 100 3} = 100 {Acoso feo- + UM f(xo -} | 


pee 


= 100 + u(0-0002)| 


f(0-03) = 100 {0-0306 + (0-0002) U}u=o 
= 3-06 
10-03 {= df' dU ot 
£°(0-03) = dU dx 
(ili) At x = 0-06. 
Since differentiation is to be done near the end of the table, Newton’s backward formula 
will be used. By the said formula, 


=(100)” {0-0002} = 2. 
-03 


£0) = F64) + UV flea) + SP 0 fe) + on, 
x= 0-06 _ 
Here ; U= 0-01 = 100x-—6 
f'® = 10 {¥ sy) + 2! 5 Ly? fq) *-- . 


100 {0-0311 + (U + 0-5) (0-0002)] 
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f’ (0-06) = 100 {0-0311 +(U + 0:5) (0-0002)} y=0 | 
= 100 {0-0311 + 0-0001} 
= 3-12 

nes <A a are 7 

f” (0-06) ‘oe a (100)? £0-0002} =2 


x = 0-06. 


d 
Ex. 5-2. Find the value of cosh x = a {sinh x} at x = 1-52 from the following table : 


x sinh x x sinhx 
1:5 2:129279 1-8 2-942174 
1-6 2:375568 "1-9 3-268163 
1-7 2-645632 2-0 3-626860 
Sol. 
Difference Table ‘ 
x f (x) = sinh x A A? A A’ A’ 
1-50 2-129279 
0246289 
1-6 2:375568 0-023775 
0-270064 0-002703 
1:7 2645632 0-026478 0:000266 
0-296542_ 0-002969 0-000026 
1:8 2:942174 0-029447 0.000292 
0-325989 0-003261 
1-9 3-268163 0-032708 
0:358697 
2-0 3-626860 
masa See 
0-1 ia | . 
For x = 1:52,U=0-2. : 
Now trom Newton’s forward formula 
U(U-1) U(U-1)(U-2) 
F(&) = f(%0) + UAF 0) + —F,— VF Go) + J A° F 0) 
U(U-1)(U-2)(U -3) 
Pipe ta At f (x0) 
U(U-1)(U-2)(U-3)(U-4 
4 OO 85 fx) 
(neglecting higher differences) 
UP aU U* -3U? +2U 
= f (0) + UAF (x0) + 5 A? f(%0) + A’ f (Xo) 
ut 6U? 4110" =6U ¢ 
+ At (x0) 
24 
U> -10U* +35U? —50U? +24U 
A i) 
120 
j ; 2U-1 3U? -6U +2 
f'(x) = 10] f(x) += A? f(x)-+=———— f(xo) 
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4U> -15 


su* —4 


+ 


f’ (1°52) = 10[0-24628 


ew G 


10 [0-2462% 
= 2:395473, 
Ex. 5-3. Assuming Newton 


d 
() f' (0) = {oo} - 


x= 


i) fe) = 5 {a4 


where yo =f (xo) etc. 
Sol. By the said formula, 


F(x) = yo + UAyo- 


if(x) _ 1 dif} 
dx 


il 
xsl >] 
b 
— 
So 
+ 


For x= 


1 2 
F'(%0) = {Ayo -3 
To obtain (ii) put x = x, sot 
Petpet Ceca 
a h 
; 1 
fier) = 7 Avot: 


Shifting the origin to x;, 


Zot 
fo) = 7 {ay - 
Ex. 5-4. Assuming Newton 
W fx) = A’yo- A’) 
Sol. By the said formula 
F(%) = Yo+ UAyo: 


x—-Xq 


Now U= h 
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102)} u=0 : ; 
| “ 4U° -18U% +22U-6 A* f(t) 
24 
=2 ; 5U4 —40U? +105U? -100U +24 
02} =n sels a ceca F(xX9) 
120 . 
j Sf’ (1-52) = 10[0-246289 + (— 0-3) (0-023775) - 
1-52 from the following table : 
4 : : 
a ee + (° *)(0-002702)+( -°=2)(0-000266)+ (° 72°) 0- 000025 
sinh x 3 15 
2.942174 = 10[0-246289 —0-0071325 + 0-0004 1446 — 000002536 + 0:00000171} 
3-268163 = 2-395473. 
3-626860 Ex. 5-3. Assuming Newton ’ forward interpolation formula show that 
. df a 1 { ae Serer ee } 
Uh de oh ee Apa Nae ype Nye s. 
© Feo) = {4 5p [Moa 8 ots 4 y0-ZA v0 
x = Xo 
3 At AS 1 1 5 1 
(ii) f’ (xo) -_ Pt {ay ‘oe y-] ree toh 
where yo =f (Xo) etc. 
1703 Sol. By the said formula, 
0-000266 
: U(U-1) U(U-1)(U-2) 
769 000292 "000076 £(®) = Yo* Ubyo+ SZ A yet Ge AP ot 
i261 : : 
G(x) _ 1 dif} 
dx h dU — 
1 2U-1 » 3U*-6U+2 3 
= — Ayo + ——— A’ yg t+ A yp ...- 
| I | YO 2 Yo 6 Yo 
+ ‘ x _ 
For x = % U= “2D =0 
U(U-1)(U-2) |; 1 1 1 
I 8 G0) of - {4 mh Ay 44 A yg tose } 
3! 0 oe Fo) h 341) 2 B4)) 3 Yo 
To obtain (ii) put x = x; so that 
QS 
(a) : h h : 
x 
; “O) = r{a tt py, -L ayy + } 
FOSS ora e vos het 
U* -3U* +2U A? f(x) Shifting the pag to x), 
oe 7 {A Neduae 13 + } 
y S(xo) = h | 2 y-1 6 Yap teeee, 
Ex. 5-4, Assuming Newton’ forward interpolation formula show that 
Wh f"(X0) = Ayo — A? yo t oan 
5 
F (0) Sol. By the said formula 
UU-1) 2, _UU-1)(U-2) 3 
) f(x) = yo+ UAyo + A’ yo + A’ yot 
U+2 43 fG) 2! 3! 


= X—Xo 


Now h 
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154 . MATHEMATICAL STATISTICS 
2 io xo + Uh ; 7 G f’(x0+$] = 1 
re 5 h 
UU-])) » 
F(%o+ AU) = yo+ UAyo + ~7; A’y 1 
U(U-1(U-2 “oh 
j Eyelets AI) 
3! 1 
Now by Taylor’s theorem mG 
h?U? 
F(x + RU) = f(%0) + AUS ’(%o) + Ff” (Xo) F ose . _ 
h 
*, Equating co-efficients of U? in (1) 
: where — yo = f 
LS eae Similarly (ii) can be prove: 
— f” = —— Wyo tee 
2! Fo) 2! 2 Yo* Ex. 5-7. Assuming Stirlin; 
h? f"(x0o) = Ary — A® yo + esos (x) at x = X09. 
Note. Equating the co-efficients of various powers of U in (1) expressions for derivatives Sol. From the said formul 
of various orders at x = xo can be obtained. : 
Ex. 5-5. Assuming Newton’s backward interpolation formula show that - f@ = 
1 1 
(i) hf’ (x0) = Vor = V'yo+ 3 Viyo+ 
Se 1 ie | 
(ii) hf’ (%o) = Vy - 3 VIS vy 
UU +1 UU +1)(U+2 
Sol. f(x) = yot UVyo+ a Jey Ee a Vy +... 
(where the origin is at x,) . =f) 
df(x) 2U+1 5 . 3U74+6U+2 —, 
hf’ («) = = =Vyo+ —— +——————_ V 
hf’ (x) dU amar | V'yo 5 Yo 
ae , 1 ee S 
@) .. hf (xo) = Vyor ig. Woe GV 20% es 
(ii) Putx=x_,so that U=-1 
. ; 1 
hf (x.1) = Vyo = 3 Vv’ Yo - Z Vyo Pi skes 
Shifting the origin to x_, 
hf’'(%o) = Vu-= = Wy = “2 Vey tin. 
Ex. 5-6. Show that By Taylor’s theorem, 
Par er aero peer : 
(ij) f (x +4) = h {Ayo oa 74 A Yo tees } ; f(x) = 
h 1 1 Equating co-efficients of 
(ii) f ha {¥¥o mya Vo tee } v7 
bf (x0) = 
- OF (xo - ;| = E' Df(o) 
2 ; R f (Xo) = 


(1 + A)"? Df (xo) 


: 1 1 1 2 f?) ia 
Now : oe ars {a-5a +38... See Ex. 5-3(i) f? Go) 


Mi 
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(1) 


Xo) Fe 


(1) expressions for derivatives 


“mula show that 


COO Ay io, 
3! 
3U? +6U+2 


Vv? 
6 Yo 


See Ex. 5-3(i) 
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rw 


=a)? [apace | fl) 


Rios As ithe ies 
D gtk te. Neer S. 
Al 2 8 \{ 2° 73 fo) 


1 1 
7 {asa + fe) 


1 1 3 
— <Ayo -— A yo... 
*{ Yo 4 Yo } 


where ~ yo = f (xo) 
Similarly (ii) can be proved. 
Ex. 5-7. Assuming Stirling's formula find the expressions for first six derivatives of f 


(x) at x = Xo. 


Sol. From the said formula 


f(x) 


By Taylor’s theorem, 


f(x) = fl%o+ Uh) =f (00) + Uf (x0) + 


uu? 1) 


2 
yo + UpSyo + or yo + 7 US? yo +... 
2 2 Z 2 
, PW 1)... -(r-1)"} 5” yy 
2r! 
U(U? -1)...(U? =r’) us yy 
(2r+1)! os 
+u {ns Ugh d a yp ea ge } 
Yo HOYo Pe Yo 30" Yo 140 Yo te 
o {s soe eS ee 1 cs 
+ — 48% yp -—8" yy +—8 VQ -——5 Yo t-- 
a \° 20-43" 207 99° 2 S60” 7 
ie a ee Oe | 
Sih yn Se Vat =O Vek 
+ 31 {e Yo 4h Yo 120° B40) 
UP ihe. de 7 28 | . 
= A peo yas 
+ {5% 6 797 249° 70 
5 1 
+ © Susy -Fu8"90 } 
5! 3 
T {P0- 480%} 
4 eae, 1S eB ain Pavey ash 
6! Yo 4 YO 


U*h? 
rT f (xo) +. 


Equating co-efficients of different powers of U 


1 1 1 
Af (xo) = pdyo- . pS? yo + 30 5° yo - —~ pS’ yot .... 


140 


1 1 1 
#£%ao) = Fyo- = S y+ 55 By se 8° yo 


90 560 


“eh 7 
Pf? (x) = w8 yo- ri US yo + — pd’ yot.... 


120 


‘ 
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h* f® (~) = & -2 8 ae = 
f (x0) Yo yor To - f(x) 


hf? (xo) = us y-> Teo Yo «+ 


AS f (xo) 


1 
5 yo— — Sy... 
0 0 0 ; 
4 ae 7 jos fe 
Ex. 5-8. Show that C. sed riche 
®: 2 2ilhele 
—— - See ad OD le D x42 —Yx~2). - oa ke 


ql 


dx 
Sol. From Ex. 5-7 (taking h = 1) . r3 2D ERM. f 
dy “pe 
( 7 } = pdyo - : us? yo (neglecting higher differences) ne eee IFS 
t Ps 
atx=x9 ow Z, 46 


bono, 1. 


1 
= = (EY? +B) (EI?) yy = 
2 Ae (sy\zomare A 


1 ; 
ee EE ESE) ant oy 5 Matherratitien 


1 satis 
= 5 (E-E)y (E+ E'-2)(E-E) 


1 
=F Orty)- adage: 


1 
= 7 -y-) -~S 02- vate O01 -y-1) 


, *, As in Ex. 5-7. 
= 3 O1-y-) -< (v2-y-2) af lan) = 
ae - if’ (Xo) = + 
Shifting the origin to x 
dy 2 1 (2) 2s 
ae: i 3 xt Ix) Fy O42 —Yx-2). ; 4 Ip a) | 
Ex. 5-9. Starting with Bessel’s formula obtain the expressions of first four derivatives h ZO (aye 
of f (x) at x = Xp. Y 
Sol. From the said formula afi) 
_ y2)sy,, UR) nee) = 
FO" Wins 7) 2 2, He vie Ex. 5-10. Using Bessel’. 
dy 
1\ U(U-1) ice ee 
+(u-3] 3! SB yint Mioa ax 
(U+r-1)(U+r-—2)...(U-r) ., Sol. From the said formu 
$$$? psd "yin 
2r! 
Ve = | 
(u-3)U+r-yusr-2)..(U-7 i 
a 2 et 1 yur + 
(2r+1)! oo 
1 Wty 1 
Now pyi2 + (u-3] yin = 2 o + Udy — > (1 - Yo) =H 


“= yo + Ubyin 
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Vx-2). > 


ther differences) 


aoa 4 


P yo ; 


2)(E-E") yp 


(E-E")} yo 
l 

5 (v1 — y-1) 
-2). 


sions of first four derivatives 


; ys Vir 


‘) x 
a us’ Vir 


(U-r) 
a Yint.. 


"1 Yo) 


‘ 
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Cxtednight 


or, 


Bessel + 


Sx) = yor 


U-1) 
rT us’ yin 


xu 
USy12 + 


1\ UWU-1 
+ (v-3) : S yin 


ease 1% 


Ae) ~ Yor 


3! 


, Utr-Utr-2)....U- 
La Tanne 


(2r+1)! 


D229 133 
Sia = 18" yp pt Bs + 
u{ 1/2 5 HOT Y1/2 D Yi/2 


us 


.. As in Ex. 5-7. 


“ol 1 1 
nf (Xo) = dyi2- 2 us’ yin + D 8 yn + D 84 yin 


Wf (x0) = 
hf (xo) = 


WF” (xo) = 


1 
Syn = 2 ys? Yin 


1 
w8*y4/2 = 2 S yin 


4 1 
(ua V2 ars vi] to. 


1 ~ | 
us yin 7 > &° Virn- D ys* V+ 


Ex. 5-10. Using Bessel’s formula show that 


dy, 


Sol. From the said formula, 


yx = 


el 
‘| 


1 
+] U-—] by). + 
Lyin ( *) Vis2t oy 


= Yi+Vo +(u-3} Ayo + 


a 
=Ay ,-—A ‘i 
de ged tye 


24 x-= 
2 


U(U-1) 


1) UU-1) 
U- | 31 S yin to... 


2 


1\)U(U-1l) , 
Us= 
| ane 


2! 


Yah vice 


(v- 5 U+r-9Utr-2)...(U-1) 


ae 
24 


ys? yi2 


r) . 
pd” y 


gti 


&° Y1/2 


2 


1/2 


120 


UU=1) A’yo +A°y-1 


1 34 
— ud 
2 HO V1/2 
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yi12 ee 


1 1 1 
 eoltsied: + 8? yy —= 8/2 -—— wd 44/2 +55 sah 
aul (4c Cte 7 {i Yu2~ yO V2 ris Y1/2 4° 2 


Sage na} 
— 46 -— pd are oe 
+ 3! Yi/2 7H Yi/2 

4 
4! 
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Here U = x-Xo (. h=)) 


1 1 
*. Change x tox + 3° Then U changes to U + > 


1 1 
U+— || U-— 
( Al 5] Myth yi 
2! 


Mty 
Yeti = Liat + UAyg + 5 
u(u+3)(u-3] 
3 
i 3! aoe 
dy , ae 
_ 2 3U* —-— 
Fay  2U Mot Aly ro 
dx mo 2 3! at 
Put x =Xq so that U=0 
dy | 
xt— 
=A ee 
dx iYo 4 Y-1 7 wseee 


x=XO0 


Shifting the origin from Xo to x — ; 


ayy 


5.2. Numerical Integration 

It is the process of computing the value of a definite integral from a set of numerical 
values of the integrand. When the function to be integrated is of single variable the process 
is called Mechanical Quadrature. The procedure is to represent the integrand by an 
interpolation formula and then to integrate this formula between the desired limits. 


Quadrature Formulae. 
(1) Trapezoidal rule. 
xg +nh 


+ : 
ydx =h {2 ne +( FQ Heeeee +d} 


: x0 
(2) Simpson’s one-third rule. 
xg tnh h 
! yds = 3 [Oot Yn) + A(y, + ¥3 + oe + Yn) + 202 +4 t + Yn-2)] 
x0 
(Can be applied only when ‘n’ is even) 
(3) Simpson’s three-eighth rule. 
xo tnh 3 
ydx = 3 h (Got yn) + 3401 + ¥2) + Wat Ys) + + On-2 + Yn-t)} 
x0 
+203 + yo +. + Yn-3)I 
(Can be applied only when ‘n’ is a multiple of ‘3’) 


1 ae iene 
(2.0 oe |F 
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cy 


& 
(4) .Weddle’s rule. 


xg tnh 
f ydx = 
x0 
(Can be applied only wh« 
Ex. 5-11. Derive genere 


from it. 
(1) The Trapezoidal rule 


(2) Simpson’s one-third 

(3) Simpson ’s three-eigh 
(4) Weddle ’s rule. 

Sol. Let y =f (x) be the : 


[T= 


Divide the range ‘a’ to ‘ 

xo = 

Let the values of y at the 
The method is to repres¢ 
integrate this formula betwe 
formula of forward differenc 


ff) = 


where U = 
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gral from a set of numerical 
of single variable the process 
»resent the integrand by an 
2en the desired limits. 


2 


Mn-1) + 202 + 4 +. + Yn-2)] 


4 +5) vee + Vn-2 + Yet} 


4 PVT huetad ee eee Cae | 
(20 a IF lo ease wea 1) 
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92 Mite ae tH¥ Eg 2 eA ; bys geet F EZ... 
(4) Weddle’s rule. pe [let é2nteeer 
xQ +nh 


3h 
J yax = To [Ovo + Sy + y2 + 6y3 + ya + Sys + Yo) 
x0 


+ (ye + Sy7 + ys + 6y9 + Yio + S11 + Y12) + - 
+ (Yn-6 + S¥n—-5 + Yn-4 F 6yn-3 + Yn-2+ IVn-1 + yn)] 
(Can be applied only when ‘n’ is a multiple of *6’. 
Ex. 5-11. Derive general quadrature formula for equidistant ordinates and deduce 
from it. ; 
(1) The Trapezoidal rule. 
(2) Simpson ’s one-third rule. 
(3) Simpson ’s three-eighth rule. 
(4) Weddle’s rule. 
Sol. Let y = f(x) be the function and it is required to evaluate 


b 
T= J fad. 


Divide the range ‘a’ to ‘b’ into n-equal parts and let the points of division be 
Xo = a,x; =Xoth,...... Xn =Xo t nh=b. 
Let the values of y at these points of division be yo, 1 «-.. Yn 
The method is to represent the integrand f(x) by an interpolation formula and then to 
integrate this formula between the desired limits. Thus representing f (x) by Newton’s 
formula of forward differences. ‘ 


U(U-1 U(U-1)(U-2 
UG) 25, WUD Ayo + .... 


F(x) = yo + UAyo + 7; 0 31 
X—-Xo 
where U= ar 
b xo tnh 
t= ff@da= J fed 
a x0 


Uu(U-1 U(U-1)(U-2 
UD py, LUDU-D 2p 


n 
h { {yo +UAyo cc 3! 
0 ! 


- LEDURMU-D 94, 
_ LUDU-MU-INU=A 485, 
pcan Ce eC cee) shyy+..4 45 


2 3 2 2 4 3 

n n n° \A yo [n 3 .2|N yo 
= h\ ny, +—Ayo t| —-—— | ot} | a te | 
G Bore (= | 2! (S 31 


5 4 6 
+ ng Bay Avo 5 Lay a2 
2 4 4 
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5 7 c,,6 4 
- oo 3 +12n?) A’¥o ¥0 + ae AO +17n? eee 
3 5! 7 4 
274-4 ”) Ayo 
a 600 | ets icsisee 
trae O a Nir (A) 


This formula is general quadrature formula. 
(1) Trapezoidal rule. 
Putting n = 1 in (A) 


*] 
1 
i= J fiaddr=nfy +z ayo}, 
x0 
Second and higher differences have been neglected as, since the interval of integration 
extends from xo to x; = x9 + A, there are only two values and with these there can be no 
differences higher than the one. 


MN-Yol_h 
h 4-H b = + 
{yo 5 } 5 (vo + 1) 


h 


#2 
h 
Similarly, h = J fde=> (y+ yp) 
x 
1 


I = J flayax= + On-1 + Yn) 


*n-1 
xo +nh x] x2 : Xn 
r= J f@de= | fddes f foddes i+ J fea 
x0 x0 x} *n-1 
= ftht..dlhs 


h 
mar) {Yo + 201 + ya F wee F Yn) + Yn} 


=h {Oos—)¢, FQ Heese +h. 


(2) Simpson’s one-third rule. 
Putting 1 = 2 in (A) and neglecting third and higher differences. 


x2 
1 
Le J foyde=h {2¥0+2Ay +340} 
x0 


1 
=h {20 +2( -y9)+5(E-1)? yo} 
1 
ah {29 +201 -¥0)+402-20 +yo)} 


h 
nae {yo + 4y1 + ya} 


h 
3 {y2 + 4y3 + ya} 


x4 
Similarly h= J fd = 
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xn 
Tin a J t( 
xn-2 
Xn 
l= J 63 
x0 
h 
= 5 loo 
h 
= 5 100 


(3) Simpson’s three-eig 
Putting n = 3 in (A) and: 


x3 
= J ro 


x0 


=h {3y 


| =h {3 


Similarly, 


he = J | 
*n-3 
(assur 
r= J f( 
x0 
3 
= 3 AD 


; +2y 
(4) Weddle’s rule. 
Putting 7 = 6 in (A) anc 


x0 
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6 4 
17H 


(A) 


the interval of integration 
vith these there can be no 


” a f fea 


Xp] 


NUMERICAL DIFFERENTIATION AND INTEGRATION 161 


xn 
h 
Inn = J Sf (x)dx = 3 {yn2+4¥n1+¥n} (assuming n to be even) 


Xn-2 
xn 


I = J f@) d=ht+ht...+tin 
x0 


h 
3 [(vo + 491 + y2) + (v2 + 493 + ya) + oe + On-2 + 4Vn-1 + Yn)] 


Hl 


| 
Ww | o> 
=r 
= 
+ 
= 
+ 
aS 
¢ 
+ 
et 
we 
+ 
+ 
‘S 
= 
I 
qa 
+ 
Ne 
F 
+ 
bd 
> 
+ 
+ 
St 
= 
& 


(3) Simpson’s three-eighth rule.. 
Putting n = 3 in (A) and neglecting all differences above the third. 


i aoe {3y9+#2ay0 +2.8%y0 +580} 
x0 


9 9 3 
=h {390+5.01-90)+ (EN)? 90 +2(E-1) yo} 
2 4 8 
9 9 12 3,3 2 
=h 3¥0 +51 -%0)+ 7 E ~2E +1)y9 + FE —3E +3E—1)yo 


‘ | 
. = ghty0 +3 +392 +93} 
Similarly, 


XG : 
3 
b= J f@)de= = h Os + By + 3ys + ¥6) 
x3 


xn 
3 
Ins = J f8)dE= 5 h es + 3yn-2 + Bn + Yn) 
Xn-3 


(assuming 7 to be a multiple of 3) 


*n 
[= J f@)de=Nht ht oda 
*0 


+203 + Yo +... + Yn-3)] 
(4) Weddle’s rule. 
Putting 7 = 6 in (A) and neglecting differences above sixth. 


fi 


I 


*6 
Lae Io 9 +18Ayp +27A"y9 +24A°y9 i At yo 
*0 


h [6yo +18(E-1) 9 +20 E-1)” yo +24(E-1)° ¥0 
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123 s. 33 5 3 ‘ Te se 
+— (E- +—(E-1 +—(E-1 -——hA 
190 (£-1) % rr "Yo 10° ) Yo i404 Yo 
1 6 
149 "4% 
Choosing ‘h’ s.t. the sixth differences are small, the last term can be neglected. 


3h 
aor {yg + Sy, +2 + O93 +4 +5y5 +6} - 


3h. 
Then ie 10 {Yo + 5y1 + yo + Oy3 + ya t Sys + yo} 
Similarly, 
x12 
3h 
doi J tea (V6 + Sy7 + yg + Oyo + Yio + S¥11 + Y12) 
%6 


Xn 
3h 
In = J f@ar=T {nt Syns t+ Yn-4 + Vn—3 + ¥n—2 + 5Vn—1 + Yn} 


xn-6 


als 


(assuming n to be a multiple of 6) 


x0 


3h 
eT [yo + Syy + yz + Gy3 + y4 + Sys + ye) + (Ve + Sy7 + ys + Gyo + Yio 


+ Sypp + Y12) + oo + Vn-6 + Sy ns + Yn—4 + OVn—3 + Yn—2+ SVn-1 + Yn) 

Note. Since in ‘Trapezoidal Rule’ second and higher difference are neglected, y is 
assumed to be linear. Similarly, in ‘Simpson one-third rule’, ‘Simpson’s three-eighth rule’ and 
“Weddle’s rule’ y is assumed to be polynomials of degree second, third and sixth respectively. 


13 
Ex. 5-12. Evaluate I= f vx dx 
1.0 
by (1) Simpson’ rule. 
(2) Trapezoidal rule. 
Sol. Take h = 0-1 


2 1-0 1-1 1-2 1:3 
ya vx: 1-000000 1-048809 1-095445 1-140175 
(1) By Simpson’s three-eighth rule 
3 
I=3 Al(yo +3) +3{M1 +23] 


3 : 
3 (0-1) [21401175 + 6-432762] 


0-321485 
(2) By Trapezoidal rule. 


+ 
l= h [2052 4¢y +2) 


= (0-1) [1-0700875 + 2-144254] 
= 0-32143415 ~ 0321434. 
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Ex: 5-13. Evaluate I = 


by (1) Trapezoidal rul 
(2) Simpsons rule. 


tT 
Sol. Take A = = 


20 
x y =cos 
0 1-0000( 
= 0.9876: 
20 
as 0-9510. 
20 
eu 0-8910: 
20 
a 0-8090 
20 
ols 0-7071 
20 
(1) By Trapezoidal rule, 
‘2 4 [20° 
mt 
= — [ll 
20 [ 
= 0-9983 
(2) By Simpson’s one-tl 
| r= 2% 
3 ( 
1 
= — fl 
20 
= 1-0004 


Ex. 5-14. Evaluate I= 


by (1): Weddle’s rule. 
(2) Simpson’s one 
(3) Simpson’ thr 
Sol. Divide the range c 


x f@)= 
0-0 1-000: 
0-5 0-800! 


_ 10 0-500) 


MATHEMATICAL STATISTICS 


rm can be neglected. 


t 


10 + Syu + ia) 


6¥n-3 + Yn—-2 + SYn-1 + Yn} 


+ (¥6 + Sy7 + yg + 6¥9 + Yio 


+ 6Yn-3 + Yn—-2 + SYn-1 + Yn) 
fference are neglected, y is 
ipson’s three-eighth rule’ and 


, third and sixth respectively. 
1:2 1:3 
95445 1-140175 
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n/2 
Ex. 5-13. Evaluate I = J cos x dx 
0 


by (1) Trapezoidal rule. 
(2) Simpson’ rule. 


T 
Sol. Take h = = 


20 
x y=cosx x y=cosx 
. 6n 
0 1-000000 30 0-587785 
Tt 70 
30 0-987688 30 | 0-453990 
mals 0-951057 ah 0-309017 
20 : 20 
a 0-891007 on 0-156434 
20 20 
4n 10m 
30 0-809017 70 0-000000 
5x 
am 0-707107 


20 


(1) By Trapezoidal rule, 


k +Y10 
2 


l=h +O 492 tt | 


= oa [10-500000 + 5:853102] 


0-9983446 = 0-998345 
(2) By Simpson’s one-third rule. 


I 


h 
3 [vo + Yio) + 401 + ¥3 +. + Yo) + 2(ya tq t+... + ys)] 


at [1-000000 + 12:784904 + 5-313752] 
1-000406. 


dx 
Ex. 5-14, Evaluate I= i 
9 it +x? 
.. by (1) Weddle’ rule. 
(2) Simpson’s one-third rule. 
(3) Simpson’ three-eighths rule. 
Sol. Divide the range of integration into twelve equal parts by taking h = 0- 5 


1 
x f@=,8 x fO=7,32 
0-0 1-00000 3-5 0-0754717 
0-5 0-800000 4-0 0-0588235 
1-0 0-500000 4:5 -0-0470588 


(Contd.) 


MATHEMATICAL STATISTICS 
EE OES 


—_— ors > see 


1 4 
x PO x I@= Tp 
a ee ee ne 
1:5 0:307692 5-0 0-0384615 
2-0 0-200000 5-5 0-0320000 
2:5 0-137931 6-0 0-0270270 
3-0 0-100000 


x 


200 
300 
400 
500 
600 


(1) By Weddle’s rule. 


I 


3 
10 (0-5) [vo + Sy) + y2 + 6y3 + yg + Sys + Vo] 


+ (v6 + Sy7 + yg + 69 + Yio + SY + y12)] 
(0-15) [8-335807 + 1-0440233] 


1-406974545 = 1-407 


(2) By Simpson’s one-third rule. 


I 


I 


logio x 


2-3010 
2-4771 
2:6021 
2-6990 
2°7782 


0-5 
> [1-0270270 + 5-6006140 + 1-7945700] 


1-4037018 = 1-404. 
(3) By Simpson’s three-eighth rule. 


3 
8 (05) [Wo + Y12) + 301 + y2) + (Va + ys) + 7 +98) + O10 td] 


+ 2(y3 + yo + yo)] 


3 
8 (0-5) [1-0270270 + 5-5280631 + 0-9095016] 


1-39961 = 1-400. 
Ex. 5-15. Compute by Simpsons rule the value of the integral 


1000 dx 


200 logig x 


taking eight subintervals. 
Sol. Here h = 100 


y = (logio x) | 
0:434594 
0-403698 
0-384305 
0-:370508 
0-359945 


By Simpson’s one-third rule, 


logiox 


2-8451 
2:9031 
2-9542 
3-0000 


y =(logio x)" 
0-3514815 
0-344459 
0-338501 
0-333335 


100 
ae [(o + ys) + 401 + y3 + Ys +7) + 202 + 4 + Vo)] 


100 
“z (0-767927 + 5-856754 + 2-177418] 


293-4033 = 293-4, 


ila ems 


NUMERICAL DIFFERENTIATION A? 


Ex. 5-16. Compute log, 2 
evaluate 


ee 
0 1+x 
Sol. Divide the range of int 
eat 
x f@=T 
0-0 1:000000 
0:5 0666667 
1-0 0-500000 
1-5 0-400000 
By Weddle’s rule, 
3 
dx 
T= f Te = 
9 l+x 
+6 (0-400 
toe 
Also . J = ) rx = 
2log.2 = 1:386785' 
Be log. 2 = 0-693392% 
Ex. 5-17. Applying ‘Simpso 
pa 
l+x 


0 
correct to three decimal places. 
Sol. Divide the range of inti 


1 
x fO=— 
0-0 1-000000 
0-1 0-909091 
0-2 0-833333 
0-3 0:769231 
0-4 0-714286 
0-5 0:666667 
= 
dx 
I =] iy. =- 
9 it+x 
0-1 
= —— {1-50 
3 I 
= 0:69315 
0-693 


Ex. 5-18. Evaluate [ = J 
07 


MATHEMATICAL STATISTICS 
——— ee eee 


fer 
0-0384615 
0-0320000 
0-0270270 


+ ye] 


- 202+ y4 t+... V10)] 


700] 


)+ 07 +8) + O10 + yi1)] 


15016] _ 

zral 

10% y= (logio x) 
151 0-3514815 
)31 0:344459 
342 0:338501 
00 0:333333 


i 


2+ y4+ y6)] 
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— oe eee 


Ex. 5-16. Compute log, 2 using a suitable quadrature formula with 7 ordinates to 
evaluate 


j #. 
0 1+x 
Sol. Divide the range of integration into six parts by taking h = 0-5 
eas peed 
7 FOr ae ‘i IO)? tay 
SO 
0-0 1:000000 2-0 0-333333 
0-5 0-666667 2:5 0-285714 
1-0 0-500000 3-0 0-250000 
1-5 0-400000 ° 
By Weddle’s rule, 
3 ‘ 
dx 3 
I= J 77 = 7, (0:5) {1-000000 + 5 (0-666667) + 0-500000 
9 itx 10 
+ 6 (0-400000)+(0-333333)+5 (0:285714) + (0-250000)} = 1-3867857 
3 de , 
Also - r= f Tay = /log (1 +2)? = log. 4 = 2 log, 2 
9 i+x 
2 loge2 = 1:3867857 
loge 2 = 0-69339285 = 0-693. 
Ex. 5-17. Applying ‘Simpson's one-third rule’ evaluate 
i 
0 1+x 
correct to three decimal places. , 


Sol. Divide the range of integration into 10 equal parts by taking h = 0-1 
eee 


1 1 
x es are x for 
0-0 1-000000 0-6 0-625000 
0-1 0-909091 0-7 0-588235 
0-2 0-833333 0-8 0-555556 
0-3 0-769231 0-9 0-526316 
0-4 0-714286 1-0 0-500000 
0-5 0-666667 


1 
dx =h 
1=J T= zor moor ty: +ys+y7+y9) +2 02+ y+ y6+¥—)} 


0-1 
= 3 {1-500000 + 4(3-459540) + 2(2-728175)} 


= 0:69315 
0-693. 
1-4 
Ex. 5-18. Evaluate I = | (sin x — log x + e*) dx 
0-2 


by (1) Simpsons rule 


(2) Weddle’ rule 


taking h = 0-1 in each case. 
Sol. Let y = sin x — log x + e&* 


x sin x 
0:2 0:198669 
0:3 0:295520 
0-4 0-389418 
0-5 0-479426 
0-6 0:564642 ~ 
0:7 0-644218 
0-8 0-717356 
0-9 : 0-783327 
1-0 0:841471 
1-1 0-891207 
1-2 0-932039 
1:3 "  0-963558 
1:4 ~ 0:985450 


MATHEMATICAL STATISTICS 


x 


log x é 
— 1:609438 1-221403 
— 1-203973 1-349859 
— 0:916291 1-491825 
— 0-693 147 1-648721 
— 0-510826 1-822119 
— 0:356675 2:013753 
— 0-223143 2-225541 
— 0105360 2-459603 
— 0-000000 2:718282 
- 0095310 3-004166 
— 0-182322 3-320117 
— 0:262364 3669297 
— 0:336472 4-055200 


(1) By Simpson’s one-third rule, 


I 


(2) Weddle’s rule. 


3h 
Lo To [vo + y2 + yatye +.¥10 +12) + 2¥6 + S01 + ys +7 + Yi1) + 63 + yo)] 


3(0-1) 
10 


y 


3-029510 
2-849352 
2:797534 
2-821294 
2:897587 
3-014646 
3-166040 
3-348290 
3-559753 
3-800063 
4:069834 
4-370491 
4704178 


h 
3 [(voty12) + 401 t+ ystys +7 t yo +11) + 202 + 4+ ¥6 + V8 + V10)] 


0-1 
> [7:733688 + 80:816544 + 32-981496] = 4-0510576 = 4-05106 


(21-058396 + 6-332080 + 67-913895 + 39-728142] 
= 4-05097539 = 4-05098. 


1 
Ex. 5-19. Show that { y, d= (5y1 + 8y0 —y-1): 
0 


Sol. By Lagrange’s formula 
(x—0)(x-1) 2 
Ye = (qroyei-p? |) (0+N(0-1)? (1+ 0-0)“ 


12 


I 


' 1 
J yx ds > 


{2 


Ex. 5-20. Show that 


1/2 


J vx = 


Jt 
1/2 2 


1 


3 


1 
2 


por 


1 


3 


1 
D (5y; + 8yo0 — y-1)- 


(x+1)(x-1) 


1 1 
5 PD yO -Dyot 5 +N 


-1} +i(F43] 
YO 23°32 Bal 


1 
12 +2} + 57 AAy-si2 -Ayi/2} 


(approximately) 


(x+1)(x-0) 


(approximately) 


NUMERICAL DIFFERENTIATION 


Sol. By Lagrange’s formy 


+ o[x°4+ 

1/2 1 

Be dx ~ — 347 
= 5 Oy 
= 5 ow 


1. Find the first derivative « 


Xe 1-00 
f(x): 0841471 0+ 
2. Find the values of f’(10 
x: 10. 
f(x): 3-162278 3. 


3. Find the value of cos 1-7 


x sin x 

1:70 0:99166 
1-72 0:98888 
1-74 0-98571 
1:76 0-98215 
1:78 0-97819 
1-80 0-97384 


MATHEMATICAL STATISTICS 


é y 
121403 3-029510 
149859 2:849352 
191825 2°797534 
348721 2:821294 
322119 2:897587 
13753 3-014646 
125541 3-166040 
159603 3-348290 
118282 3-559753 
104166 3-800063 
320117 4:069834 
369297 4:370491 
)55200 4:704178 


+yy1) + 202 +4 t+ Yet Yat Yio)] 


81496] = 4:0510576 = 4.05106 


501 + ys + ¥7 + y11) + 603 + y9)] 


913895 + 39-728142] 


Yo + 
+x) 


+5) 
) YI 


1/2} 


(approximately) 


(x+1)(x-0) 
(1+1)(1-0) *! 


(approximately) 
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a ee NOY 


ement em 
Cee e3) 323) 
AA” MGI 
ee) 
FEI 


+ 


FIR es OY © OD 
iz = 7 Bg %-3/2 Fg Y-12 F542 74 23/2 


(2 


; {1/2 +Yi2} + ri {(v-12 ~y-3/2)-Osi2 -yi2)h 


1 1 
— fy_yot + — {Ay_4)5 —-A . 
5 12 +12} 4 {Ay_3/2 -Ayi/2} 


Id 


EXERCISES 


1. Find the first derivative of the function tabulated below at the point x = 1-002. 


x: 1-00 1-01 1-02 1-03 1-04 1:05... 
F(x): 0-841471 —0-846832 -0-852108 0-857299 0862404  0-867423 


2. Find the values of f’(10), f’(15) and f’(12) from the following table : 


x: 10 11 12 13 14 15 
J (x): 3:162278 3-316625 3-464102 3-605551 3-741657 3-872983 
[Ans. 0-158109, 0-1291 : 0-144337] 


3. Find the value of cos 1-76 = FE {sin a] using the following table : 


x = 1:76 
x sin x x sin x 
1:70 0-99166481 1-82 0-96910913 
1:72 0:98888977 1-84 0:96398300 
1-74 0:98571918 1-86 0-95847128 
1:76 0-98215432 1-88 0-95257619 
1:78 0-97819661 1-90 0-94630009 


1-80 0-97384763 ar 
[Ans. U-18807675] 
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4. 


12. 


13. 


14. 


15. 


16. 


MATHEMATICAL STATISTICS 


Find the values of £’(0-6) and f”(0-6) from the following table : 
0-4 0-5 0-6 0-7 0-8 

1-7974426 2:3275054  2-6510818 

[Ans. 2-644225; 3-64424] 


xX: 
f(x): 1-5836494 20442376 


. Find the value of f’(1:0), £”(1-0), (1-0), f’(1-10), £”(1-10) and f’”(1-10) from the 


following table : 

x f(x) x f(x) 
1-00. 1-00000 1-20 1-09544 
1-05 1-02470 1-25 1-11803 
1-10 1-04881 1-30 1-14017 
1:15 1-:07238 


ST 


[Ans. 0-50024; — 0-256; 0:4; 0:4767; — 0-216; 0:320] 


. Find the values of f’(1-72), f°(1-7), £15), £(20), £17), £15), and f"(2-0) 


from the following table (For central values use Stirling’s formula). 
x: 1-5 1-6 1:7 1:8 1-9 2:0 
f(x): 0405465 —0-470004 0-530628 0-587787 0-641854 0-693 147 
[Ans. 0-581393; 0-588230; 0-666702; 0-500027; 
— 0:345858; — 0-445391; — 0:248809] 


. Find the value of (0-425), f’(0-65), (0-425) and f”(0-65) from the following 


table : 
x: 0-4 0-5 0:6 0-7 0-8 
So: 0-389418 0-479426 0-564642 0-644218 0-717356 
[Ans. 0-911056; 0-796092; — 0-412735; — 0-604942] 


. Prove (i) of Ex. 5-3 (a) by the method of Ex. 5-4 () by using operators. 
. Prove (i) of Ex. 5-5 (a) by the method of Ex. 5-4 (5) by using operators. 


Find the expressions of f”(xo) and f’”(xo) in terms of backward differences. 


d 
. Find oe at x = 1 from the following table constructing a central difference table : 


d 
x: 1 2 3 4 5 6 
y: 198669 295520 389418 479425 564642 644217 
Show that 


. 1 feo). chee 
and alae (83-35 8+ Bade! } 


10 
dx 
Evaluate / = J rae by dividing the range into eight equal parts. [Ans. 1-299] 
2 


5 
dx 
Evaluate [= J ea by Simpson’s rule. [Ans. 1-62] 


1 
5-2 1 ; 3 
Evaluate J = j P dx by (1) Simpson’s rule (2) Weddle’s rule. 
4 
(Taking h = 0-2). [Ans. 0:262364; 0-262364] 


90° 
Evaluate J logy sin x dx by Simpson’s rule (taking ten subintervals). [Ans. — 0-095] 


30° 


NUMERICAL DIFFERENTIATION Al 


17. Using Simpson’s rule and 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


x: 0-5 0-6 
y: 0-4804 0-5669 
Evaluate the integrals 


11 Il 
@fod&k Ww fy 
0-5 0-5 

5.2 
Compute / = j log, xd) 
4 


Evaluate J e* dx using S 
0 

x: 0’ 

ée: 1 


0:3 
Evaluate f (1—8x°)! G 
0 


n/2 
Compute f sin x dx by (1 
0 


If U,=a+ bx + cx’, prov 
3. 
J Usdr - 
1 
Calculate by Simpson’s on 
equidistant ordinates. 


Find an approximate value 


x: 1-00 
y: 3-953 
Use Simpson’s rule:to pro 


MATHEMATICAL STATISTICS 
PI kaa cba a 


ing table : 
0:7 0:8 
2:3275054  2-6510818 


[Ans. 2-644225; 3-64424] 
"(1-10) and f’”(1-10) from the 


ff) 
1:09544 
1-11803 
1:14017 


5; 0-4; 0-4767; — 0-216; 0-320] 
£17), f'(1'5), and f"(2-0) 
1g’s formula). , 
1-9 2:0 
0:641854  0-693147 
588230; 0-666702; 0-500027; 
1858; — 0-445391; — 0:248809] 
df”(0-65) from the following 


0:7 0-8 
0644218 0-717356 
3092; — 0-412735; — 0-604942] 
b) by using operators. 
b) by using operators. 
of backward differences. 


g a central difference table : 


5 6 
125 564642 644217 | 
ee } 
it equal parts. [Ans. 1-299] 


[Ans. 1-62] 


iddle’s rule. 
[Ans. 0-262364; 0:262364] 


‘en subintervals). [Ans.— 0-095] _ 
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17. Using Simpson’s rule and the table : 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


x 0:5 0-6 0-7 0-8 0-9 1:0 1-1 

y: 0-4804 0-5669 0-6490 0-7262 0-7985 0:8658 0-9281 
Evaluate the integrals 

it 11 Il Ll 
@ fod @ fydxe i) f x*ydx (iv) f yP de. 

0-5 0-5 0-5 05 
52 [Ans. 0-3585, 0-3210, 0-3104, 02444] 
Compute / = i log, xdx by (i) Simpson’s rule (ii) Weddle’s rule (Taking h = 0-2). 

4 
[Ans. 1-827847] 


Evaluate J e* dx using Simpson’s rule and the following data: 


0 
0° 1 2 3 4 


x: 
é: 1 2:72 7:39 20-09 54-60 
0:3 
Evaluate { (1-8x*)'* dx using Simpson’s three-eighth rule. [Ans. 0-29159] 
0 
n/2 
Compute f sin x dx by (1) Trapezoidal rule (2) Simpson’s rule. (using 11 ordinates). 
0 
[Ans. 0-9981, 1-0006] 
If U,=at bx + cx’, prove that 
; 1 
f U,dx = 2u2+ DD (uo — 2u2 + ua) 
1 3 
Calculate by Simpson’s one-third rule an approximate value of J x4 dx by taking 7 


equidistant ordinates. -3 
1-04 
Find an approximate value of J ydx given the following values : 
1-00 
x: 1-00 1-01 1-02 1-03 1-04 
y: 3-953 4-066 4-182 4-30( 4-421 


Use Simpson’s rule to prove that log, 7 is approximately 1-95. 


OO 


Curve Fitting and Method of Least Squares 


6.1. Introduction 
Fitting of curves to a set of numerical data is of considerable importance—Theoretical 
as well as practical. It is based on the following principle known as principle of least squares. 


Principle of Least Squares 

It says that the best or most probable value of the sguisured quantity is that value for 
which the sum of the squares of the errors is least. 
6.2. Solving System of Linear Equations 


Consider the system of equations 
44, X + Ay. X2 + ateee + Ain Xn = dy 
21 X1 + A272 XQ + Aon Xn = b2 
Any X1 FT Ame X2 + oe + Amn Xn = Bm 
In the case when m > n, the equations are solved by writing 


m 
2 
= > (b)- aj) — 4X2... — Gin Xn) 
f=] 


and minimizing S. From calculus, the equations determining x,, x2 ....X, So that Sis minimum 
are 


AL. 0, ? 
Oxy Ox 
These equations are called Normal Equations and the values Of X1, X2 4..-., X, obtained 
from these are called best or most plausible values. 
Ex. 6-1. Form the normal equations and hence find the most plausible values of x and 
y from the following. 
x +y=3-01, 2x-y=0-03, x + 3y=7-03, 3x + y= 4-97. 
Sol. Let S=(x+y—3-01)? + (2x—y— 0-03)? + (x + 3y— 7-03) + Bx +y— 4-97) 
Normal equations are 


a. 8s 
ax 8 y 
as 
Now Ox = 0 imply 
(x + y — 3-01) + 4(2x — y - 0-03) + 2(x + 3y — 7-03) + 6(3x + y — 4:97) =0. 
ie., 15x + 5y — 25-01 =0 (1) 
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os 63 
By i 
2(x + y— 3-01) — 2(2 
5x +1 
Solving (1) and (2) 
x = 0¢ 


Ex. 6-2. Find the most pla 
“x t2y+z=1, xt 


Sol. Let S=(¢ 
+ | 
Normal equations are 
rN 
0" 3 
+ 
Le., 22x 
o = 2 
ce) 
+ 
ie, l1x 
2 
oar 
Le., - 19; 
From (1), (2) and (3) 
x=1-17, y=: 


Ex. 6-3. Find the most pl 
x-yt2z=3, 3x 


Sol. Let S=€ 
+ 
Normal equations are 
c 
0O=- 
C 
+ 
1Lé., 27x 
€ 
O=-5 
c 
+ 
Leé., 6x + 
¢ 
Ou. 
¢ 
+ 
ieé., yr 
From (1), (2) and (3) 


x=2-47, y=3 


east Squares 


ile importance—Theoretical 
as principle of least squares. 


ed quantity is that value for 


X2 ....X, So that Sis minimum 


ues of x1, X2 ,...., X, obtained 
ost plausible values of x and 


= 4.97, 
- 7:03) + (3x +y-4-97" 


) + 6(3x + y — 4-97) =0. 
(1) 
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os eaaal 
— = 0im 
oy ply 
2x + y— 3-01) — 2(2x — y — 0-03) + 6(x + 3y — 7-03) + 23x + y— 4-97) = 0. 
5x + 12y— 29:04 =0 (2) 
Solving (1) and (2) ; 


x = 0-9995, y=2-0035. 
Ex. 6-2. Find the most plausible values of x, y and z from the equations given below : 
“xt2ytz=1, wtytz=4, -xty+2z=3 and 4x+2y—5z=-7. 
Sol. Let S = (x+2y+z-1P + (Qx+y+2-4) + (x+y + 22-3) 
+(4x + 2y-5z+7) 
Normal equations are 
os 
0= ae eet eyez I) Tae tye 2e4) eee ye ae) 
+ 8(4x"+ 2y - 5z+7). 
Les 22x + 1ly-19z+22=0 (1) 


0 = . =4(x+ 2y+z—1)+2(2x+yt+z-4)+2-x+y+2z-3) 


+ 4(4x + 2y-5z+7) 
ie, lix + 10y—5z+5=0 (2) 


os 
0= a =2(x+2yt+z—1)+2(Qxt+yt+z—-4)+4(-xt+y+ 2z-3) 


— 10(4x + 2y-5z+7) ...(3) 
Le., — 19x -— 5y + 31z-46=0 © (3) 
From (1), (2) and (3) 
x=1-17, y=-0-75, z=2:08. 


Ex. 6-3. Find the most plausible values of x, y and z from the following equations : 
x-yt2z=3, 3x+2y-Sz=5, 4x+y+4z=21, —x+3y+3z=14. 
Sol. Let S = (x-y+2z-3)9 + Gxt 2y— 52-5)? + (4x+y + 42-21) 
+ (-x + 3y + 3z- 14) 
Normal equations are 


os 
0= oe = 2(x—y+ 2z-3) + 6(3x + 2y — 5z-5) 
+ 8(4x + y + 4z—- 21) - 2(-x + 3y + 3z—- 14) 
i.e, 27x + 6y — 88 =0 AA) 
os 
= Bio ee ee ee) 
+ 2(4x + y+ 4z-21) + 6(-x + 3y + 3z- 14) 
ie., 6x + 15y+z-70=0 (2) 
. as 
0= Be RNs oy 122 9) NCE ey oe 9) 
+ 8(4x + y + 4z—21) + 6(-x + 3y + 3z- 14) 
ie., y+54z-107=0 ..(3) 


From (1), (2) and (3) 
x =2-47, y =3-55, z= 1-92. 
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Ex. 6-4. A man is three times as old as his son. Ten years hence his age will be twice the 
age of his son. Five years before the age of the man was five times that of his son. Find their 
present ages. 


Sol.. Let x and y be the ages of the man and his son. 


Then x = 3), ; 
x+10 = 2(y+10) ie, x—2y-10=0 
‘and x-5 = 5(y-5) ie, x—Sy+20=0 
Let S = (x-3y)? + (x-2y- 10) +(x— 5y + 207 


Normal equations are 


as 
0 =  =2(x- 3y) + 2¢-2y—- 10) + 2(¢—- 5y + 20) 


ox 
ie, 3x-10y+10=0 (1) 
os 
and 0 = Ge A 1) 10 = 520) 
ie., — 10x + 38y — 80 =0 (2) 
From (1) and (2), 


x= 30, y=10. 
6.3. The Method of Least Squares F 

Let y = f(x), be the formula (containing m unknown parameters a}, a2 .... dm), Whose 
form is to be inferred from the results of experiment or observation and in which the unknown 
parameters are to be determined from experimental or observational data (x1, y1), (x2, y2) «.- 
(%ns Yn) (n > m). These sets of simultaneous values of x and y would, when substitiited in the 
formula, give n equations in m unknowns a}, a2, ... dm, and from these equations the best 
values of a}, @3....dm are to be obtained. To solve this problem method of Least Squares is 
used. 

The method of Least Squares says that the best representative formula is that for which 
the sum of the squares of the residuals (i.e., most probable value — measured value) is 
minimum. Since the squares of the residuals are positive, the requirement that their sum 
shall be as small as possible ensures that the numerical values of the residuals will be small. 


_ Let Yn = fi) 
Residual forx =x; =y;-y; 
.~ .. By the method of Least Squares aj, a, .... 7 are to be obtained so that 


n n 
s= > tiny}? =X xi)-y3" 
i=] i=1 
is minimum. From calculus the equations determining a}, a2 .... dm, are 
os os os 


day ay Cam 
These equations are called Normal Equations. 
6.4. Curve Fitting 


(1) Fitting of Parabolic Curves. 
To derive the least square equations for fitting a curve of the type. 
y = ant ayxt ax? +. t dmx” (Qm # 0) 
to a set of n points. 
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Let (x, y) i = 
Y= a 
Let S= : 
i 
By the method of least sq 
oS 
bas, = 0 
Now cs = ; 
0ao i: 


First normal equati 


n 
Sy =n 
i=] 


ae os 
Similarly ia 0 
1 
n 
> XiVi = a 
i=1 
2 
> xi Vi =a 


n 
> xi" Yj > 
i=] 


Eqs. (1) to (m + 1) are rec 
6.4-1. Corollary 


Fitting of a straight line. 
The equation of a straighi 


I 
a 


ya 

.. The normal equations 
yan 

and ixy =a 


The calculations are sim 
instead of y and x. Generally, 


Ex. 6-4-1.1. Show that t 


(Xn» Yn) may be expressed in tl 


x oy 
2X; = 
ix? £ 


Ex. 6-4-1.2. Using the pr 

(x, yi = 1, 2, .... n with a vie 
Simplify the equations wi 
0,+ 1,42, 0.0... +h 


MATHEMATICAL STATISTICS 


nce his age will be twice the 
es that of his son. Find their 


ie, x—2y-10=0 
ie, x—-Sy+20=0 


+ 20) 
2(x — Sy + 20) 

AL) 
- 10 (@-5y +20) 

(2) 


leters a1, Az .... Ay), Whose 
nand in which the unknown 
onal data (x1, 1), (*2, y2) ... 
ild, when substitiited in the 
m these equatioris the best 
nethod of Least Squares is 


'e formula is that for which 
alue — measured value) is 


‘equirement that their sum 
the residuals will be small. 


e obtained so that 


Am are 


? type. 
(Gm # 0) 
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a 


Let (x, vi) i = 1, 2, .....1 be the given data and 
Y= ao + ax; + anx? + veegas + An Xj 


Let S 


n n 
a (W-y)? = > {ao + ayx; + 0... + Angxi” — yj}? 
j= i= 


By the method of least squares S is to be minimized. Normal equations are 


as. as sas 
ag day Oa, 
as “ 
Now —=) 2 {a9 + ayxj+ a2 x7 +... + dg xf" — yi} 
Cag jz} 
.First normal equation is 
n n n n 
DY = nagt ay Y xytag Y. xP + ccm Spee (1) 
i=l i=l i=l i=l 
os 
Similarl = = 0.... —— =0umpl 
yey, Bay, py 
. 2 m+ 
D = a YE +0, YD xP +... +0, ya, (2) 
i=l i=l i=l i=l 
a) a eee m+2 
DY = a > xP +a DOP totam Dx} (3) 
i=l i=l i=] i=l 
i ~ a m+l 2 
m 
PY = ag Dt xP tty YX (m+ 1) 


Eqs. (1) to (m + 1) are required equations. 
6.4-1. Corollary 

Fitting of a straight line. 

The equation of a straight line is 

Y= agtayx 
.. The normal equations are 
Ry = nag + ayXx 

and Xxy = ao Lxta; Xx’ 

The calculations are simplified by taking the variable v and u (to be chosen suitably) 
instead of y and x. Generally, u is chosen s.t. Zu=0. 

Ex. 6-4-1.1. Show that the best-fitting linear function for the points (x1, y1); (x2, 2); 


(Xn, Yn) may be expressed in the form 


x y 1 
2X; Ly; n = 0. 
Ex? xy Ly; 


Ex, 6-4-1.2. Using the principle of least squares, fit a straight line to the pairs of values 
(x; vi =1, 2, .... n with a view to determine y for given x. 
Simplify the equations when the possible values of x are 
A as ee tk and n=2k +1 
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Ex. 6-5. Fit a straight line trend by the method of least squares to the following data : 


Year Milk consumption (Million litres) 
1990 102-3 
1991 . 101-9 
1992 ——-:105-8 
1993 112-0 
1994 114-8 
1995 118-7 
1996 124-5 
1997 102-9 


Sol. Let x and y be the variables for years and Milk consumption. 


Let the straight line to be fitted be 


v = at+bu 
where the co-efficients ‘a’ and ‘b’ are to be determined from the normal equations. 
Xv = na+ b<u 


Luv = adu+biw? 
Substituting the values of Zv, Zu etc. 
—13-1 = 8a-4b 
85-9 = (-4a)+ 446 
b = 1:89, a=-0-69 
Eq. of the ss line is 


= —0-69+ 1:89 u 
ie, y—112 = —0-69 + 1-89 (x— 1994). 
Ex. 6-6. Fit a straight line trend by the method of least squares to the following series. 
Year Price Index 
1991 107. 
1992 110 
1993 114 
1994 112 
1995 115 
1996 . 13 


hs ees 
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Sol. Let x and y be the v 


v= 
where the co-efficients ‘a’ an 
Lv =) 
Luv = « 
Substituting the values o 
-1l= 
and 22 = 
Multiplying (2) by ‘2’ ai 
43 =. 
or b= 
*, From (1) 
6a = 
or a= 
*, The equation is 
y= 
or y-112 7 
Ex. 6-7. Compute the si 
least squares; determine the 


Year 


1999 
2000 
2001 
2002 
2003 


Sol. Let x and y be the 
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ares to the following data : 
Million litres) 


u=x-— 1994 


v=y-112 


e normal equations. 


ares to the following series. 


dex 


pan fy 
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Sol. Let x and y be the variables for years and Price Index. 


Let the equation of the straight line to be fitted be 
v = a+bu 
where the co-efficients ‘a’ and ‘b’ are to be determined from normal equations 
Ly = nat brxu 
Luv = aku t bru’ 
Substituting the values of Zu, Lv etc. 


-1 = 6a-3b .(1) 
and 22 = (-—3)a+19b (2) 
Multiplying (2) by ‘2’ and adding to (1) we get 
43 = 35(b) 
or b = 1:23 
. From(1) 
6a = 2-69 
or a = 0-45 


*, The equation is 
vy = 0-45 + 1-23u 
or y-112 = - 0-45 + 1-23 (x — 1994). 
Ex. 6-7. Sie the straight line trend equation for the data below by the method of 
least squares; determine the annual trend estimates of each year. 


Year Sales in thousands of Rs. 
1999 25 
2000 30 
2001 40 
2002 50 
2003 45 


Sol. Let x and y be the variables for years and sales. 
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Let the equation of the straight line be 
v = a+bu 
where the co-efficient ‘a’ and ‘b’ are to be determined from the normal equations 
Lv = nat bru 
Luv = aXu+ b&w? 
Substituting the values of Zu etc. 
-—10 = Sa or a=-2 
and -60 = 105 or b=6 
.. The equation is 


v = -2+6u 
(v-—40) = —2+6(x-2001) 
Annual Trend estimates for each year are shown in the table. 
Ex. 6-8. Show that the line of fit to the following data is given by y = 0-7x + 11-28 : 
x 0 5 10 15 20 25 
y: 12 15 17 22 24 30 


Let the equation of the line be 


v = atbu 
The normal equations are 
Lv = na+ bru 
and Luv = alu+ brIw 
Substituting the values of Zw etc. 
18 = 6a-—3b (1) 
52 = —3a+19b ...(2) 
b = 3-486 
a = 4-743 


ws The equation to the straight line is 
v = 4-743 + 3-486u 
x-15 


4-743 + 3-486 (4) 


or y-17 
0-7x + 11-28. 


or y 
6.4-2. Corollary 

Fitting of a second degree parabola. 

The equation of a second degree parabola is 

y= Ap + a4 X + ax" 
*, The normal equations are 
Zy = nagot+a,Xx+ andx" 
“xy = apixt aie + ax? 
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and Ley = « 
Ex. 6-9, Fit a parabolic 
independent variable : 


xs 0 
yi: 1 
Find out the difference b 
the fitted curve when x = 2. 
Sol. 


The normal equations ar 
Xv =i 

Zvu = 1 

Zw? = « 

Substituting the values o 
04 =: 

113 = ° 

or b=) 
85 = - 

ne c= 
.. The equation of the s: 
ve=- 


". Value of y for x =2 ¢ 


Also, actual value of y fc 
’. Difference = 1-3- 
Ex. 6-10. Fit a paraboli 

5 oe 1-0 
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normal equations 


ven by y = 0:7x + 11-28: 
25 


30 


(1) 
(2) 


a 
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a EEE 


and Xx’y = aux’ + axe a an=x', : 
Ex. 6-9. Fit a parabolic curve of second degree to the following data taking x as the 
independent variable : 
Kes 0 1 2 3 4 
ae 1 1:8 1:3 2:5 6:3 
Find out the difference between the actual value of y and the value of y obtained from 
the fitted curve when x = 2. 


Sol. 


Let the equation of the parabola be 
v = atbutc’ 
The normal equations are 
Lv = na+byYut civ 


Lvu = alut b3u* + ctw 
Zw? = aXu’ + bdu> + cru! 
Substituting the values of Zu, Lv etc. 


0-4 = Sa+10c (1) 
11-3 = 105 

or b = 1-13 ...(2) 
8-5 = 10a+34c 


ae c = 0-55, a=- 1-02 
.. The equation of the second degree parabola fitted to the given data is 
v = ~ 102+ 1-134 + 0-55u? 
y = 1-48 + 1-13 (x—2) + 0-55(x- 2) 
*, Value of y for x = 2 obtained from the fitted curve 
= 1-48 
Also, actual value of y for x = 2 is 1-3. 
*, Difference = 1-3 - 1-48 =-0-18. 
Ex. 6-10. Fit a parabolic curve of Regression of y on x to pairs of values : 
5 a 1-0 1-5 2:0 2°5 3-0 3-5 40 
y: 1-1 1:3 1-6 2:0 2-7 3-4 4-1 


2 7 8 16 14 
3 41 14 27 81 42 
| ff of f-27 [28 | oo | 196 | 43 | ~57 | 
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Let the curve to be fitted be 
v = atbu+cu 

The normal equations are : 
Lv = nat bdu+ clu’ 


Du = aku+ bru? + cx? 
Luv = aku? + bYu? + cdu* 
Substituting the values of Su, Zv etc. 


2 


-27 = 7a+28c (1) 
143 = 28b 
_ 143 
or b= a 5-107 
~57 = 28a+ 196c (2) 
Multiplying (1) by ‘4’ and subtracting from (2) 
84c = 51 
or c = 0-607 
Multiplying (1) by ‘7’ and subtracting from (2) 
21a = -132 
or a = — 6286. 


.”, The equation of the second degree parabola fitted to the given data is 
v = (— 6-286) + (5-107)u + (0-607)u? 


x-2°5 
+ (0-607 
0.5 } (0-607) ( 
y = 10354 - 0-1926x + (0-2428)x”, 
Ex. 6-11. Fita second degree parabola to the following data, taking x as the independent 
variable : 


x25) 


y-2-7 : 
or (27) = (-6:286) + (5-107) ( 0-5 


0-1 


9 


fof Tt 2 fo | of m8 | si | ~6 | 


—y 


Let the parabola of second degree to be fitted be 
vy = atbutc 
The normal equations for this are : 
Ly. = nat bdu+ cL 
Iw = akut bw + cl 
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Dw? = 
Substituting the values « 
2= 
51 = 
or b 
-69 = 
i.e., -23 = 
Multiplying (1) by ‘20’ 
924c = 
or c= 


*. From (1), a= 


Thus the equation of th« 
Vv = 
Substituting the values « 


y-8) = 


y 
(2) Fitting of Curves « 
Taking logarithm 
logigy = 
or Y 
(where y= 
The constant A and b ca 
From A, a is obtained on tak 
(3) Fitting of curves o. 


fl 


Taking logarithm © 
logioy = 
or Y= 
(where y= 


The constants A, B and 
Ex. 6-12. The populati. 
Year : 193 
Population in 
Millions : 3% 


By fitting a curve of the 
Sol. 
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given data is 


607) ( 


x-2: 


0-5 


sy 


AL) 


(2) 


taking x as the independent 


4 
6 
2 
0 
0 
3 
6 
6 
4 
1 


ss saree arenes 
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Dow? = aku? + Yue + cdut 
Substituting the values of Xu, Xv etc., in these equations : 


2 = 9a+60c (1) 
51 = 605 
or b = 0-85 
-69 = 60a + 708c 
i.e., —23 = 20a + 236c (2) 
Multiplying (1) by ‘20’ and (2) by ‘9’ and subtracting 
924c = —247 
or c = —0-267 


: 18-02 
. From (1), a= ar as = 2-002 


Thus the equation of the parabola of second degree is 
= 2-002 + (0:85)u — (0-267)uv" 
Substituting the values of u and v 
(y—8) = 2-002 + (0-85) (x — 5) — (0-267) (x - 5)’, 
= (— 0-923) + (3-52)x — (0:267)x? 
(2) Fitting of Curves of the type y = ax? 


Taking logarithm 
logioy = logyoa + b logio x 
or = A+bxX 
(where y = logioy, A=logioa, and X= logiox) 


The constant A and b can be cre as in (6.4-1) by using y and_X instead of y and x. 
From A, a is obtained on taking antilog. ; 
(3) Fitting of curves of the type y = ab* 


Taking logarithm © 
logioy = logioa + x logio b 
or y = At+xB 
(where Y = logioy, A=logioa, B=logiox) 


The constants A, B and hence a, b are obtained as in (2). 
Ex. 6-12. The population of a state at ten yearly intervals is given below : 
Year: 1931 1941 1951 1961 1971 1981 1991 2001 
Population in 
Millions : 3-9 5:3 73 96 12:9 17-1 23-2 30:5 


By fitting a curve of the from y = ab’ to this data estimate the population for 2011. 
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The equation to the curve to be fitted is 
y = ab* 
Taking logio 
logiey = logio a+ x logiob 
or y = A+xB 
where y = logioy, A=logio a, B= logo b 
Replacing x by u 
y = A+Bu 
‘The normal equations are 
Ly = nA+ Blu 


“uy = Alu+ Bx? 
Substituting the values of Zy etc. 


8:3544 = 84-4B (i) 
and * +» 11707 = (-4)4+44B ...ii) 
Multiplying (ii) by ‘2’ and adding to (7) 
10-6958 = 84B 
or B = 0:12733 
‘, From (i) 
84 = 8-86372 
A = 1:107965 
’. The equation is 
y = 1:107965 + 0-12733u 
x-1971 
= 1-107965 + 0-12733 (=) 
. For x = 2011, 
yY = 1-107965 + 0-50932 


i) 


1-617285 
1-6173 
41-43 


“ Population for in year 2011 = 41-43 millions. 
Ex. 6-13. (a) Derive the least-square equations for fitting a curve of the type 


b 
= ax’ +— to a set of n points. 
y a P 


(b) Fit the curve y= ax’ + . to the data given below : 


fe 1 2 3 4 
y: -1-51 0-99 3-88 7:66 
Sol. (a) Let (x;y), 7= 1, 2, ... m be the given data and 
b 
xj 
n 2 n 2 b 2 
Let S= >» (yi = > ax; + Yi 
i=l i=l a! 


Normal equations are 


—— 
ie > yixi = ad xp +b dx _() 
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ra} 

and 0= 3 
n 

i.e., ies =a 


(1) and (2) ar a ied e 
(b) 


Substituting values in eqs 


159-9300 = 3 

and 2:1933 = 1 
From (3) and (4) 

a=0 


*. The equation of the ct 
y = 0 


Ex. 6-14. Derive the lea: 
set of n points. 
Sol. Let (x;, y)i=1, 2 


Y=6 


Let S =) 


Normal equations are 


as _ 
Oa i 
n 
ie : >; 
i=l 
n 
Lé., » Xi¥i = 1 
i=l 
d as ( 
an <= 
ob 
ie, De ze, 


(1) and (2) de cer 
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.(Z) 
ii) 


w fitting a curve of the type 


Ade 
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and 


o 
Ul 
, 218 
UW 
Ms 
N 
a 
8 
he 
+ 
= | 
t 
x 
ee 
| 
ae eae) 


n 
Le, ies 

i=l *i i=l i=l 
(1) and (2) are required equations. 
(5) 


(2) 


Il 
gQ 
M 
Ral 
+ 
o 
Ms 
to 
nN — 
ad 


1-4236 | 159-9300 2-1933 


Substituting values in eqs. (1) and (2) 


159-9300 = 354a+ 10b ; (3) 
and 2-1933 = 10a+ 1-4236b (4) 
From (3) and (4) 
a = 0-509, b = -2-04 
*, The equation of the curve best fitted to the given data is 
= 0-509 x? - ona 
x 


b 
Ex. 6-14. Derive the least-square equations for fitting a curve of type y = ax + — x” a 
set of n points. 
Sol. Let (x;, yi = 1, 2.... 2 be the given data and 


b 
Y= ax; + — 
xj 
no. 2 n b 2 
Let s= > %-y) => [axi+b-s] 
j=] j=] [ 
Normal equations are ; ; 
os “ b 
— = 24 ax;+——); ; 
da 2 By feo 
n 
Le. > (ax? +B xii) = 0 
i=1 
n 
1é., > ivi = nb+a ¥, xj (1) 
i=l i=l 
and ab 0 2d i x; i Xx; 
a4, ae | 
ie, 2 tt L natb pees (2) 
Xj, i=l % 


=1 
(1) and (2) are required equations. 
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Ex. 6-15.- Three independent measurements on each of three angles A, B, C ofa triangle 
are as follows : 


A: 39-5 39-3 39-6 
B: 60:3 62-2 60-1 
Cx 80-1 80-3 80-4 


Obtain the best estimates of the three angles, taking into account the relation that sum 


of the angles is equal to 180°. 
Sol. Let the measurements for angles A, B and C be denoted by x, y and z respectively. 


Then Lx = 39-54 39-3 + 39-6 = 118-4 
Ly = 60-3 + 62-2 + 60-1 = 182-6 
and Xz = 80-1 + 80-3 + 80-4 = 240-8 
Let a, B and y be the true values of angles A, B and C respectively. 
Then y = 180-a-B. 


E{(a-x)+(B-y) + (y-2)} 
E {(a—x) + (B-y’) + (180-0 - B-2)} 
By the principle of least squares, S is to be minimized. Normal equations are 


Let S 


ie = ¥[2(a— x) — 2(180 -— a B -2)] 


oa 
ie, 22a+=XB = Lx-=z+ 2180. 7 
ie, 6a+ 3B = 118-4—240-8 + 540 = 417-6. 
ie, 2a+B = 139-2 (1) 
os 
and » 0 = ap 7 [2B -») - 2(180- 0 -B-2)] 
which'implies. 
a+2B = 160-6 
From (1) and (2) 
a = 39-27, B= 60-67 
y = 180-a-f = 80-06. ..(2) 


EXERCISES 


1. Find the most plausible values of x and y from the following equations : 
x+y=3, x-y=2,x+2y-4=0, x= 2y+1. [Ans. x= 2:5, y=0-7] 
2. A and B are two brothers A is ten years older than B. Five years before Ass age was 
_ twice that of Bs. Five years hence twice the age of A will be same as three times that 
of B. Find their present ages. [Ans. 25, 15] 
3. Fit a straight line to the data given below : 
xs 6 7 7 8. 8 8 9 9 10 
y: 5 5 4 5 4 3 4 3 3 
[Ans. y=— 0-5x + 8] 
4. Fit a straight line to the following data treating ‘y’ as the dependent variable. 


xe 1 2 3 4 5 
: 5 7 9 10 11 [Ans. y = 3-9 + 1-5x] 
5, Fit a straight line to the data given below; showing the production of a commodity in 
different years : 
Year x: 1991 1992 1993 1994 1995 
Production, : 10 12 8 10 14 
(1000 tons) 


[Ans. y = 0-6 (x — 1993) + 10-8] 


t 
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6. Fit a straight line to the fo 


~ 


10. 


11. 


12 


. 


13. 


14, 


15. 


16. 


x: 1 
y: 1 


. The weights of a calf tak 


using the method of least 
Age: 1 2 
Weight : 52:5 587 


. Find the equation of the 


following points : 
x: 0-5 
y: . 031 0 


Below are given figures o 
Year : 1991 
Production : 80 
(in thousand 
mounds) 
Fit a straight line trend by 


‘Fit a straight line to the fo 


x: 1 
y: 2-4 
Fit a straight line to the fo 
x: 1 
y: 1-4 
The profits, Rs. 100y, of a 
x: — 
: 25 
Fit the second degree para 
=3)'| 
Fit a parabola of second de 
x: 1 
y: 1:8 : 
The profits Rs. y of a certi 
x: 1 
y: 1250s. 
Show that the parabolic re 
y = 1140+ 72x 
Fit a second degree parabi 
x: 0 ( 
y 3-1950 3. 
x: 0-6 ( 
y 3-1807 3-: 


Fit a second degree parab: 
x: 0 
y: 1 
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:angles A, B, Cofa triangle 


39-6 
60-1 
80-4 
‘count the relation that sum 


d by x, y and z respectively. 


ectively. 
-B-2)"} 
rmal equations are 
~2)] 
.(1) 
~2)] 
.(2) 


ving equations : 
[Ans. x= 2-5, y= 0-7] 
ve years before As age was 
| be same as three times that 
[Ans. 25, 15] 


8 9 9 10 

3 4 3 3 

[Ans. y =— 0-5x + 8] 
dependent variable. 


[Ans. y = 3-9.+ 1-5x] 
‘oduction of a commodity in 


b =: 1995 
14 


s. y = 0-6 (x — 1993) + 10-8] 
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6. 


10 


11 


. 


12. 


13. 


14. 


15. 


16. 


Fit a straight line to the following data regarding x as the independent variable : 
Xx: 1 1 2 3 4 


y: 1 18 3:3 45 63  [Ans.y=0-72+ 1-33x] 


. The weights of a calf taken at weekly intervals are given below. Fit a straight line 


using the method of least squares and calculate the average rate of growth per week. 

Age: 1 2 3 4 5 6 7 8 9 10 
Weight : 52:5. 58-7 65:0 70:2 75-4 81-1 87:2 95:5 102-2 108-4 
[Ans. y = 79-62 + 6-16 (x — 5-5); 6-16] 


. Find the equation of the straight line which comes nearest to passing through the 


following points : 
x? 0-5 1-0 1-5 2:0 2:5 3-0 
y: . 031 0-82 1-29 185 2-51 3-02 
[Ans. y =— 0-285 + 1-10x} 


- Below are given figures of production of a sugar factory. 


Year: 1991 1992 1993 1994 1995 1996 1997 1998 
Production : 80 90 92 83 94 99 92 100 
(in thousand 

mounds) 
Fit a straight line trend by the method of least squares to the above data. 
[Ans. y = 94 + 3(x— 1995)] 


‘Fit a straight line to the following data : 


x: 1 2 3 4 5 8 
y: 2-4 3 3-6 4 5 6 
Fit a straight line to the following data : 
x: 1 2 3 4 6 8 
y: 1-4 3 3-6 4 5 6 
The profits, Rs. 100y, of a certain company in the xth year of its life are given by 
x: 1 2 3 4 5 
y: 25 28 33 39 46 
Fit the second degree parabola ofyonx. —[Ans. y= 32-914 + 5-3 (x~3) + 0-643(x 
~3y) 
Fit a parabola of second degree to the following data taking x as independent variable: 
bs 1 2 3 4 5 , 
y: 1-8 5-1 9-0 14 19 
The profits Rs. y of a certain company in the xth year of its existence are given by: 
x: 1 2 3 4 5 
y: 1250 1400 1650 1950 2300 


Show that the parabolic regression of y on x is 
= 1140 + 72x + 32-15x? 
Fit a second degree parabola to the data given below : 


x: 0 0-1 0-2 0-3 0-4 0-5 
y 3-1950 3-2299 3-2532 3-2611 3-2516 3-2282 
x 0-6 0-7 0-8 0-9 
y 3-1807 3-1266 3-0594 2-9759 


[Ans. y = 3-1951 + 0-4425x — 0-7653x"] 
Fit a second degree parabola to the following data : 
x: 0 1 b) 3 4 
y: 1 5 0 22 38 
[Ans. y = 5-914 + 9-1 (x — 2) + 3-643 (x— 2)7] 
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[Ans. y=4-642 e°4™] 


[Ans. y = 100(1-2)"] 


5 6 7 -8 

49-1 65:6 87-8 117-6 
[Ans. y = 11-58 ene] 

125 15-0 

16 11 


[Ans. y = (113-4) e° 144] 


2:5 3-0 
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17. Fit the curve y = ae™ to the data given below : 
xs 0 2 4 
y: 5-012 10 31-62 

(e = 2-71828) 

18. Fit the curve y = ab* to the data given below : 
x: 2 3 4 #5 
y: 144 1728 207-4 248-8 

19. Fit the curve y = ae™ to the following data : 
x: 1 2 3 4 
y: 15:3 20:5 27-4 36°6 

20. Fit y = ae™ to the following data : 
x: 25 5:0 75 10-0 

ys 7% 52 35 25 

21. Fit the curve of the type xy” = b to the following data : 
x: 0:5 1-0 1-5 2:0 
y: 1-62 1:00 0-75 0-62 


22. Use the method of least squares to determine ‘a 


for the following data : 
x: 1 2 3 4 
y: 1:8 5-1 8-9 14-1 
Calculate the value of y for x = 2. 
23. Fit y =a + bx’ to the following data : 
x: 5 7 9 11 


2 290 560 1044 1810 


0-52 0-46 


’ and ‘b’ in the formula y = ax + bx? 


5 
19-8 


[Ans. a = 1-521, b= 0-49, 5-006] 


12 
2300 : 
[Ans. y = 130-71 + 1-2572x"] 
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[Ans. y= 4-642 e7**] 


[Ans. y = 100(1-2)"] 


; 6 T38 
\1 65-6 = 87-8_—«117-6 
[Ans. y= 11-58 &°7"*] 


15 15:0 

6 11 : . 
[Ans. y = (113-4) 0.154 4] 

5 3-0 

52 0:46 


‘h’ in the formula y = ax + bx’ 


5 
9-8 
is. a= 1:521, b=0-49, 5-006} 


12 
300 ; 
[Ans, y = 130-71 + 1-2572x'] 
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Probability 


_JACAntroduetion 


A fundamental principle involved in the experiments of science and engineering is that 
if these experiments are repeatedly performed under very nearly identical conditions the 
results are essentially the same. But, there are experiments in which results will not be 
essentially the same even though conditions may be nearly identical. Such experiments will ~ 
be called random experiments or simply experiments. 

Below are certain terms which will be used subsequently. 

Trial. Performing of an experiment is called trial. _. 

Cases. Various possible outcomes of a trial are termed as cases. 

Event. /t is used to represent the aim with which the experiment is performed. 

Sample space. /t is the set of all possible outcomes of an experiment. 

Event is a subset of sample space and cases are its members i.e., subsets consisting of 
single members. 

The class of all events associated with a given experiment is defined to be the event 
space. , 

(E.L.) Equally Likely Cases (Events). Cases (Events) are called equally likely when 
none of them can be preferred rather than the other. 

(M.E.) Mutually Exclusive Cases (Events). Cases (Events) are called mutually 
exclusive when no two of them can occur simultaneously. . 

Exhaustive Cases (Events). A set of cases (events) is said to be exhaustive if it includes 
all possible outcomes of a trial. 

Favourable Cases. The cases which entail the happening of an event are said to be 
favourable to an event. , 

In a trial, there is always uncertainty as to whether a particular event will occur or not. 
To measure this uncertainty the idea of probability (or chance) was introduced. It is the 
number between 0 and 1. If the event is sure to occur its probability is taken to be 1 and if the 


: ian fee 2 
event is sure not to occur its probability is taken to be zero. If the probability is a it means 


that there are 25% chances for the: event to occur and 75% chances for the event not to occur. 
Below are two definitions of probability of an event. 


7.1.1. Mathematical Definition 
Let ‘n’ be the number of cases which are equally likely, mutually exclusive and exhaustive 
and ‘m’ of these are favourable to the happening of an event ‘A’. Then the probability of the 


happening of A is defined to be - 
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7.1.2. Statistical Definition 

If a trial is repeated a number of times under essentially the same conditions, then 
limiting value of the ratio of the number of times the event happens to the number of trials, 
as the number of trials increases indefinitely, is called the probability of the happening of 
that event. (It is assumed that the ratio approaches a finite and unique limit). 

Both these definitions have serious difficulties, the first because the word “equally likely” 
is vague and second because of the vagueness of infinite number of trials. Because of these 
difficulties, the following axiomatic approach to probability was introduced. 


7.1.3. Axioms of Probability 


Let Q be a sample space. Let cx be the class of events in Q. To each A in of is 
associated a real number P(A) st. 


(1) P(A) 20, VA €ce7 
(2) P(S)=1 
(3) If A), Ag)... are any number of mutually exclusive events in c°/ then 
P(A, + Az +...) = P(A,) + P(A2)+... 
Obviously, P is a real valued function defined on cP. is called a probability function 
and P (A) the probability of the event A and 0 < P(A)<1. 


The triplet (Q, c°%, P[-]) is called a probability space. 
Remarks. Starting with definition (7.1.1) axion (3) of (7.1.3) can be proved. In view of 


O(A) 
this P(A) is taken to be OQ) » where O(A) denotes the order of A. 


Odds in favour of and against an event are defined as below: 


Prob. of happening 


Odds in favour of an event = Probof hon happening 


ee Prob. of non - happening 
Odds against an event ; | Diehrorhpeniie. b. of happening 

To find the no. of ways of getting a certain sum in rolling dice. 

No. of ways of getting a sum ‘r’ in rolling ‘n’ f-faced dice 

= co-efficient of x" in(x+ x7+...xf)" 

Ex. 7-1. The chance of an event happening is the square of the chance of a second 
event but the odds against the first are the cube of the odds against the second. Find the 
chance of each. : 


Sol. Let p and p’ be the chances of happening of two events. 


Then — p= p” 
= 
Odds against the first event = ~~ 
l-p’ 
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i 
which implies p’ = 3 


Ex. 7-2. In an experimer 

twice as likely as w; (j =1,2. 
Sol. Let J 
Then | 


Now, 


Since, w,,...w, are exha 
Pit P2 


Pi (i + 2+..6 


1 
Pil 


Ex. 7-3. The sum of twe 
product of two quantities is n 
Sol. Let y be one quantit 


Then (2n - x) is the oth 
Let 
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which implies p’ = 3 


Aad 
9 


Ex. 7-2. In an experiment there are n outcomes W,,W,...W,. The outcome Wj, is 


twice as likely as w; (j=1,2....n—1). Find P(A,) where. Ay =(W,,W9,-.Wy)- 


Sol. Let P(w;) = Pj j=l,2...n-1 
Then Pjur = 2P; 
Now, P2 = 2p, 


P3 = 2p, =2’p, 

Pa = 2p3=2?p, 

Pg = 20D, 
Since, w,,...w,, are exhaustive, 


Pit Pzt--Py = 1 
p, (1+ 2+..42"") =} 


r= \ 
1 =Il>p,= 
1-2 PES pe ay 


k 
P(A,) = > Pw) = Pi + Pate APE 


i=l 


Pi (1 + 2+...424 1) 


29% 
Pi 1-2 
_ 2F-1 
2” -1 
Ex. 7-3. The sum of two positive quantities is equal to 2n. Find the chance that the 
product of two quantities is not less than 3/4 times of their greatest product. 
Sol. Let y be one quantity. . 
Then (2n— x) is the other quantity 


Let y= x(2n-x) 
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dy 
a 2n-2x 
is maximum when id = 
- dx 
Leé., é : Qn-2x = 0 
ie, x= 


#4 Maximum value of y = n 
«. x is to be such that 


x(2n-x) ¢ =n 


ie, | 3n? —Bnx+4x? ¥ 0 


ie., ; (3n—2x)(n—-2x)# 0 
n 3n 
—<x<— 
2 2 
3n on 
.. No. of favourable cases = 7 - 2 =n 


and total no. of cases = 2n 


nl 
d. prob. = =~ =7. 
Reqd. pro on 2 
Ex. 7-4. What is the chance that (a) a leap year selected at random will contain 53 


' Sundays, (b) a non-leap year selected at random will contain 53 Sundays ? 


Sol. (a) Ina leap year there are 366 days i.e., 52 weeks and 2 days. Remaining two days 


can be any two days of the week. Different possibilities are : 


Sunday 
Monday 
Tuesday 
Wednesday 
Thursday 
Friday 
Saturday 


In order to have 53 Sundays, out of remaining two days one must be Sunday. 


and Monday 
and Tuesday 
and Wednesday 
and Thursday 
and Friday 

and Saturday 
and Sunday 


No. of cases favourable to the event of having one Sunday out of 2 days = 2 


Total number of cases = 7 


2 
_*. Reqd. prob. = 7 


(b) In anon-leap year there are 365 days i.e., 52 weeks and 1 days. Remaining 1 day can 


be any day of the week. 
. Total no. of cases = 7. 


There will be 53 Sundays if the remaining one day is Sunday. 


.. No. of favourable cases = 1 


1 
.. Reqd. . =r 
Reqd. pro 7 


PROBABILITY 


Ex. 7-5. Two cards are d 
chance of drawing two aces 
Sol. Total number of wat 


are 4 aces out of which 2 ace 


5 Reqd. prob. 


Ex. 7-6. From a pack of 
is a king and the other a que 


Sol. Total number of ca: 


Since in a pack there 4 } 


.. Reqd. prob. 


Ex. 7-7. Four cards are ¢ 


ms that they are from four diffe: 


Sol. Total number of ca: 


Since there is 13 cards 0 
suits 


Req 


Ex. 7-8. From a set of | 
the chance that 

(i) Its number is a muli 

(ii) Its number is multiz 

Sol. (i) Total number of 


The number on the card 
.. Number of favourab: 


Rec 
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Ex. 7-5. Two cards are drawn at random from a well-suffled pack of 52. Show that the 


1 
chance of drawing two aces is 1 


Sol. Total number of ways of drawing two cards out of 52 =**c,, Ina pack of 52, there 


are 4 aces out of which 2 aces can be drawn in 4c ways. 


4c, 
.. Reqd. prob. = 92 
C2 
eu zal, 
52-51 221 


Ex. 7-6. From a pack of 52 cards, two are drawn at random. Find the chance that one 
is a king and the other a queen. 


Sol. Total number of cases = **c, 


Since in a pack there 4 kings and 4 queens, number of favourable cases = 4c, i C). 


-. Reqd. prob. — = "52 


= 663 . 
Ex. 7-7. Four cards are drawn from a well shuffled pack of cards. What is the probability 
that they are from four different suits ? 


Sol. Total number of cases = **¢,. 


Since there is 13 cards of each suit, no of ways of drawing 4 cards belonging to different 


suits 
= (Pa)(Pa){a)(e)=13 


(13)* 2197 


Reqd. Prob. = 225, zi 20825 . 


Ex. 7-8. From a set of 17 cards numbered 1, 2,....17, one is drawn at random. What is 
the chance that 


(i) Its number is a multiple of 3 or 7? 
(ii) Its number is multiple of 3 or 5 or both? 


Sol. (i) Total number of cases = ne =17. 


The number on the card drawn will be a multiple of 3 or 7 if it is 3, 6, 7,9, 12, 14, or 15. 
.. Number of favourable cases = 7. 


7 
Read. prob. = 7 


190 MATHEMATICAL STATISTICS 


(ii) The number on the card drawn will be a multiple of 3 or 5 or both if it is 3, 5, 6, 9, 
10, 12 or 15. 
.. Number of favourable cases = 7. 


: 7 
“. Reqd. prob. = 7 


Ex. 7-9. (a) If n biscuits are distributed at random among N beggars, what is the 
chance that a particular beggar receives r(<n) biscuits ? ost 
Sol. Since one biscuit can be given in N ways, number of ways of distributing 1 biscuits 
among WN beggars = N”. ; a 
If one particular beggar receives r biscuits, remaining (n-r) biscuits are to be distributed 


among (N-1) beggars and this can be done in (N —1)""" ways. 


r biscuits to be given to one particular beggar can be chosen in "c, ways. 


.. Number of favourable cases = "¢,.(N —1)"" 


"co (N bas iy 
N" 

(b) What is the most probable number of biscuits distributed to a particular beggar? 

"Cc, (N-)"" 


n 


. Reqd. prob. = 


Sol. Let P(r) = 


Most probable number of biscuits distributed to a particular beggar is that value of r 
which is s.t. 


P(r-1)< P(r)2 P(r +1) 


Consider P(r-1I)< P(r) 
"c,(N-1)***! 2 "c (N~1)"" 
Lé., ee eS a 
N N 
' ! 
ie. ——————__. (N-1) ¢ ——_. 
? (r-D!(n-r+l)! ri(n—-r)! 
: N-1 wl 
oe n+r+l or 
i.e., . r(N-l) s(n4+))—-r 
n+l 
Lé., . r Sa 
Consider P(r) > P(r +1) 
No (N=1)"" "cy (N-1)""! 
i.e. eS Se 
» N" N" 
j n! n 
; ——. (N -1) > ——___—_—_ 
7 : Gene ) (r+Di(n-r-D! 
N-1 1° 
Lé., sae 2— 


n-r r+l 
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Lé., 


Le, 


.. Most probable value 


Since r is the integer, it 


n+l. . 
In case Ny 1s an integer, i 


Ex. 7-10. Two different 
the probability, that the sum 


their sum will exceed 13 eac. 


J is 
is 78 


Sol. Total number of wz 
(2) Different possibilitie 


Number of favourable c 
. Prob. of choosing 2 ¢ 


(ii) Different possibilitic 


Number of favourable c: 
.. Probability of choosi 


(iii) There are only thre 
chosen in +85 ways. 


.. Number of favouabli 


3 
“. Reqd. Breb, OR. 


Ex. 7-11. /f four squares 
they should be in a diagonal 
Sol. A chess board is a si 
Diagonal BD divides the 
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Leé., r(N-1I)+(N-l2n-r 
n+l 
i.e. r>——-] 
ie., N 
.. Most probable value of r is such that 
n+1 n+1 
—-l<r<— 
N 


; : : a ntl. ntl. ‘ 
Since r is the integer, it is the greatest integer less than Sie if —— is not an integer. 


n+l 
d —--1l. 
an N 


Ex. 7-10. Two different digits are chosen at random from the set 1, 2, 3,.....8. Show that 
the probability, that the sum of the digits will be equal to 5 is the same as the probability that 


n+l. ’ n+l 
In case yy Cisan integer, r can take both values —— 


1 
their sum will exceed 13 each being i Also show that the chance of both digits exceeding 


5 poe 
iS 38" 


Sol. Total number of ways of choosing 2 digits = Be, = 28. 
(i) Different possibilities of getting 2 digits with sum 5 are : 


Ist-digit 2nd-digit 
1 4 
2 3 


Number of favourable cases = 2 
. Prob. of choosing 2 digits with sum 5 = — 


(ii) Different possibilities of getting 2 digits with sum exceeding 13 are : 


Ist-digit 2nd-digit 
6 8 
7 8 


Number of favourable cases = 2 
.. Probability of choosing 2 digits with sum exceeding 13 
atiate 
14° 
(iii) There are only three digits exceeding 5 and out of these three digits two can be 
chosen in 3¢, ways. 


.. Number of favourable cases =? 


€2 = 3. 
3 
“. Reqd. prob; = 38 


Ex. 7-11. If ‘four squares are chosen at random on a CHESSBOGG, find the chance that 
they should be in a diagonal line. 

Sol. A chess board is a square divided into 64 equal squares parallel to the sides. 

Diagonal BD divides the board in two equal As ABD and CBD. In AABD, four squares 
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along a diagonal line can be chosen along L,M,,L,M2,L3;M3, L4M, or DB which contain 


respectively 4, 5, 6, 7, 8 squares. 


. Number of ways of selecting 4 squares in A ABD = 4c, +c, 6; +c, +c, 

This is also the number of ways of selecting 4 squares in each of As BCD, ACD, and 
ABC. 

. Total number of ways of selecting 4 squares along a diagonal line in the square 


ABCD = afc, $e, 4, ef cy} +2. wor 


= 364 
(This is because the diagonals BD and AC are common to As ABD, BCD and ACD, ° 
ABC). 
Also total number of ways of selecting 4 squares out of 64 
= Wes 
36413 
«. Reqd. prob. = 64, ~ 92692 ° 


Ex. 7-12. A five-figure number is formed by the digit 0, 1, 2, 3, 4 (without repetition). 
Find the prob. that the number formed is divisible by 4. 
Sol. Total number of ways of arranging digits 0, 1, 2, 3, 4 
=5!=120! 
If a number start with ‘0’ the remaining 4 digits can be arranged in 4!=24 ways and 
hence total number of five-figure numbers = 120 - 24 = 96. . 
Now the numbers ending with 04, 20, 40, 12, 24 and 32 are divisible by 4. 
No. of numbers ending with 04 = Total number of ways of arranging digits 1, 2, 3 
=3!=6. 
Evidently this is also the number of numbers ending with 20 and 40 respectively. 
For the numbers ending with 12, out of total number of ways of arranging remaining 
three digits 0, 3, 4 the number of ways in which ‘0’ occurs first are to be discarded. 
(.. these will give four-figure numbers). 
. No. of numbers ending with 12 = 3!— 2! =4. ; 
Evidently this is also the number of numbers ending with 24 and 32 respectively. 
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“. Total number of five 
divisible by 4 


. Reqd. prob. 
Ex. 7-13. Out of (2n+ 
Find the chance that the nu 


Sol. Total number of cz 


Different possibilities « 
1 


. 


awn tm K t.d AD 


and so on. 
Number of terms in 1s! 
Number of terms in 2n 
Number of terms in 31 
Number of terms in 4tl 
Number of terms in 5tl 
and so on. 
.. No. of favourable c: 


“. Reqd. prob. 


Ex. 7-14. Four cards 
probabilities of the followi 
(a) The cards are of tl 
etc.) 

(b) There is at least or 

(c) Only two of the fot 


Sol. (a) Total number 
First card (say A, ) ca: 
the 36 cards obtained on di 
the third card (say A, ) mu: 
to the suits and denominati 


obtained on discarding the 


.. No. of favourable ¢ 
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Cc 
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2, 3,.4 (without repetition). 
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.. Total number of five-figure numbers, formed by the digits 0, 1, 2, 3, 4, which are 


divjsible by 4 
= 3(6+4) = 30 
-. Reqd. prob -2.2 
. Reqd. prob. Se ic: 


Ex. 7-13. Out of (2n+1) tickets consecutively numbered, three are drawn at random. 


Find the chance that the numbers on them are in A.P. 


Sol. Total number of cases = 


2n+1 


C3. 


Different possibilities of drawing tickets with their numbers in A.P. are : 


and so on. 
“. No. of favourable cases 


. Reqd. prob. 


n 


1525:35°1,. 3,555 cescsadeiiean 1, (n+ 1), (2n+ 1) 
2,3,4;52,4,6; ee 2,(n+1), 2n 
He oho bees Pes ft eee 3, (n+ 2), (2n + 1) 
4,5,6;4,6,8; ee 4,(n+2),  2n 
55:6, 725 6h Oss — “sdbedeessties 5, (n +3), (2n + 1) 
and so on. 

Number of terms in Ist sequence n 

Number of terms in 2nd sequence = (n—1) 

Number of terms in 3rd sequence = (n- 1) 

Number of terms in 4th sequence = (n-2) 

Number of terms in Sth sequence = (n-2) 


2nt+le, ~ 4n? -1 


Ex. 7-14. Four cards are drawn out at random from a full deck of 52. Find the 
probabilities of the following contingencies. 
(a) The cards are of the four different suits and of different denominations (1, 2, king 


etc.) 


(b) There is at least one ace-card. 
(c) Only two of the four suits are represented. 


Sol. (a) Total number of ways for taking out 4 card 


s= 2 


4: 


First card (say A, ) can be any out of 52. Then the second card (say A, ) must be from 


the 36 cards obtained on discarding the cards belonging to the suit and denomination of A, 


the third card (say A, ) must be from the 22 cards obtained on discarding the cards belonging 


to the suits and denominations of A, and A, and the fourth card must be from the 10 cards 


obtained on discarding the cards belonging to the suits and denominations of A,, A, and A,. 


52, 36, 22. 10 
Ce ae 
. No. of favourable cases = ( PSAs al ! V 
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(. Four cards can be arranged among themselves in 4! ways). 


52 36, 22 10 
Cc) C\ Ce cy =0-06 


20, -4! 
(5) and (c) are left as exercises. 


Ex. 7-15. Out of 3n consecutive numbers 3 are selected at random. Find the chance 
that their sum is divisible by 3. 


“. Reqd. prob. = 


Sol. Let 31 consecutive numbers be p+l,.....p+3n where p is any integer. These 3n 
numbers can be arranged as follows: 


...(A) 


pt+3n-2 p+3n-1 p+3n 
Numbers in three columns have the property that the sum of any three numbers in any 
particular column is divisible by 3. Now each column consists of n numbers and hence 
number of ways of selecting 3 numbers from any one particular column ="c,, 


Also the numbers in (A) are s.t., if three numbers are chosen one from each column, 
their sum is divisible by 3. 


Now number of ways of selecting three numbers from (4) one from each column = n°. 
.. No. of favourable cases 


= 1343." cy 


n 2 
= —[3n* —3n+2]. 
Tok n+2] 


Also total number of cases 


=, = 5 (On -9n+2) 


3n* -3n+2 
9n* -9n+2° 
Ex. 7-16. If 6n tickets numbered 0, 1, 2,.......6n-1 are placed in a bag and three are 
drawn out, show that the chance that the sum of the numbers on them is equal to 6n is 
3n 
(6n-1) (6n—2)° 


.. Reqd. prob. = 


Sol. Total number of cases =°" C3. 
Different possibilities of drawing tickets with the sum of their numbers equal to 6n are : 
0,1,627-—-1, 0,2,6n-2; 0 wow. 7 0, 3n-1,324+1 
1,2,6n-3, 1,3,6n-4; wou. : 1, 3n—1,3n 
2,3,6n—5, 2,4,6n-~6; cucu. : 2, 3n—2,3n 
3,4,6n-7, 3,5,6n-8; wu. ote 3, 3n — 2, 3n—-1) 
4,5,6n—9, 4,6,6n-10; ww. z 4, 3n-3,3n-1 


Tee omen ere cee eeenbncn sane recerenasecerenseuns seoeeeseeneenseseneness 


2n—2, 2n-1, 2n+3; 2n-2, 2, 2n+2 ; ° 2n-1, 2n, 2nt+1 


PROBABILITY 


No. of terms in first sequen 

No. of terms in 2nd sequen 

No. of terms in 3rd sequen 

No. of terms in 4th sequen 

No. of terms in Sth sequenc 
and so on. 

.. Total number of ways of 


= {(3n-1)+(3n -2)}+(( 
= {(3n-1)+(n-4)+(n- 


_n sat Rate 
= 5 (4432-Di45 (243 


3n2 
. Reqd. prob. = oe 
6n,, (€ 
Ex. 7-17. Four different ol 
marked 1, 2, 3, 4. What is the 
corresponding to its number ? 
Sol. Total number of ways « 
Number of ways in which a 
If three objects occupy their 
discussed above. 
If two objects occupy their 
other’s position i.e., in only one 
Since out of 4, two objects « 
two objects can occupy their pla 


If one object occupies its pi 
ways by occupying the positions 
Since out of 4, one object c: 
one object can occupy its place. 


.. Total number of ways in: 


_.. Total number of ways in v 
to its number 


“. Reqd. prob. 


Ex. 7-18. A and B stand in 
persons is at random, find the chi 
Sol. There are in-all 12 pers 
person remaining 11 persons can 
.. Total number of ways in \ 
Out of 10 persons three are 


'¢, ways and can be arranged i 
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i). 


‘andom. Find the chance 


is any integer. These 3” 


(A) 


any three numbers in any 
of n numbers and hence 


umn ="c,. 
1 one from each column, 


from each column = n3, 


1 in a bag and three are 
n them is equal to 6n is 


umbers equal to 6n are : 
1,32+1 

1,3” 

2, 3n 

2, Bn —-1) 

3,3n-1 


2n, 2n+1 
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No. of terms in first sequence = 32-1 
No. of terms in 2nd sequence = 2n- 2 


£ 


No. of terms in 3rd sequence ='5,, 4 

No. of terms in 4th sequence = 3 —5 

No. of terms in Sth sequence = 37, —7 
and so on. 


“. Total number of ways of drawing tickets with the sum of their numbers equal to 6n 
= {(3n-1)+(3n-2)}+{(3n- 4) +(3n-5)}+{(3n-7) +(3n— 8)}4+....4(2 +1) 
= {(3n-1)+(3n- 4) + (3n-7)+...42}4+{B3n- 2) + (3n-5)+(3n-8)+....4]} 

n 


= 5 (4+ 3n-Dh4 5 2+ Hn—D} = 3n? 


3n? 3n 
. Reqd. prob, =~ = ——"__. 
cee Ga. GRINGO) 


c3 


Ex. 7-17. Four different objects 1, 2, 3, 4 are distributed at random on four places 
marked 1, 2, 3, 4. What is the probability that none of the objects occupies the place 
corresponding to its number ? 

Sol. Total number of ways of distributing 4 objects on 4 places = 4!=24, 

Number of ways in which all the four objects can occupy their places = 1. 

If three objects occupy their places, 4th will also do so. So this is contained in possibility 
discussed above. 


If two objects occupy their places, remaining two can go wrong by occupying each 
other’s position i.e., in only one way. 
Since out of 4, two objects can be chosen in 4c, ways, number of ways in which only 
two objects can occupy their places. 
= 4c, x1=6. 
If one object occupies its position, any one of the remaining three can go wrong in 2 
ways by occupying the positions of other two. 
Since out of 4, one object can be chosen in 4c, ways, number of ways in which only 
one object can occupy its place. 
= 4c, x2=8. 
-. Total number of ways in which at least one object can occupy its place 


= 14+6+8=15 
_ #, Total number of ways in which none of the objects occupies the place corresponding 
to its number 


= 24-15=9 
gees = 
.. Reqd. prob. m4 8 


Ex. 7-18. A and B stand in a ring with 10 other persons. If the arrangement of 12 
persons is at random, find the chance that there are exactly three persons between A and B. 

Sol. There are in-ail 12 persons who are to stand in a ring. Fixing the position of one 
person remaining 11 persons can stand in a ring = 11! 

“. Total number of ways in which 12 persons can stand in a ring = 11! 

Out of 10 persons three are to stand between A and B. These three can be chosen in 


u C3, ways and can be arranged in 3! ways. 
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Also number of ways of arranging remaining 7 persons = 7! 
Since A and B can interchange their position, number of ways of having exactly 3 persons 


between A and B = 2% 6, 317 
= 23! 
= 2.10! 
.. Reqd. prob. = a = =a 
11! 11 


Ex. 7-19. The first 12 letters of the alphabet are written at random. Find the chance 
that there are exactly 4 letters between A and B. 
Sol. Total number of ways = 12! 
Different possibilities are : 
1 2 3 4 5 6 #7 8 9 10 11 12 
A é , é : B - : . : : é 
A R . : F B 


A F . : ‘ B : 
A : : ‘ : B : 
. . . : $ : A F . : . B 
Out of remaining 10 letters, 4 letters are to lie between A and B. These four can be 
chosen in 10c, ways and can be arranged in 4! ways. 
Also number of ways of acangig remaining 6 letters = 6! and A and B can interchange 


their positions. 
. Total number of ways of having exactly 4 letters between A and B. 


= 7.2. ¢,.41.6 
1 
age ate 
614! 
= 7.2.10! 


eer gee Mee ee 
“. Reqd. prob. 5, ce 
Ex. 7-20. If the letters of the word ‘REGULATIONS’ be arranged at random, what is 
the chance that there will be exactly 4 letters between the ‘R’ and the ‘E’ ? 
Sol. There are in all 11 letters to be arranged. 
Total number of ways of arranging 11 letters 
=i1! 
Different possibilities are : 
1 2 3 4 5 6 #7 8 9 10 11 
R ; ‘ ‘ < E i: 3 F . “ 
R : 2 - : E 


PROBABILITY 


Therefore, as in last exam 
the ‘R’ and the ‘E’ 


Therefore, reqd. prob. 


Ex. 7-21. Show that the cl 


Sol. There are six faces of 
.“. Total number of cases = 
Out of six faces, three are 
_ .. Number of favourable ¢ 


“. Reqd. prob. 


Ex. 7-22. In a single thro 
eleven. 


Sol. Total number of cases 
(i) The sum 8 can be obtai 
Fi 


.. The number of favourat 


. Reqd. prob. 


(ii) The sum 11 can be obti 
Fi 


“. Number of favourable c. 


“. Reqd. probability 


Ex. 7-23. Find the chance 
Sol. Consider the expressic 
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=7! 
iys of having exactly 3 persons 


1 at random. Find the chance 


B : 
B : 
: : B 
a A and B. These four can be 


5! and 4 and B can interchange 


veen A and B. 


2 arranged at random, what is 
2’ and the ‘E’ ? 


9 10 It 
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Therefore, as in last example, total number of ways of having exactly 4 letters between 


the ‘R’ and the ‘E’ 


Therefore, reqd. prob. 


1 
Ex. 7-21. Show that the chance of throwing an odd number with a die is 5 


Sol. There are six faces of a die marked with numbers from 1 to 6. 


.. Total number of cases = 6. 


Out of six faces, three are marked with odd numbers viz., 1,3 and 5, 


= 6.2.9 ¢4.41.5 
= 6.2.9! 
_ 629! 

IL! 

62 6 


~ THO 55° 


_ .. Number of favourable cases = 3. 


“. Reqd. prob. 


Ex. 7-22. In a single throw with two dice, find the chances of throwing (i) eight, (ii) 


eleven. 


Sol. Total number of cases = 6 x 6 = 36, 


(i) The sum 8 can be obtained in either of the following ways : 


First die 


6 
5 
4 
3 
2 


.. The number of favourable cases = 5. 


.. Reqd. prob. 


(ii) The sum 11 can be obtained in either of the following ways : 


5 


~ 36° 


First die 


6 


5 ‘ 
.. Number of favourable cases 


.. Reqd. probability 


Ex. 7-23. Find the chance of throwing a total of 3 or 5 or 11 with two dice. 


Sol. Consider the expression 


: 


tl 
2 


l-x 


Second die 
2 


3 
4 
5 
6 


Second die 


(etx 4 ax")? 


aa 
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6,2 -2 
x*(1-x°)?(1- x) = x7(142x43x7 442345 
Therefore, number of ways | 


x2(1—2x6 +x?) +20 43x" 44x34...) 


. Number of ways of getting a total of 3 
= co-efficient of x* = 2 
and number of ways of getti 


Number ot ways of getting a total of 5 


= co-efficient of x° =4 | 
Total number of cases 


and Number of ways of getting a total of 11 
Therefore, probability of gel 


= co-efficient of x!! =10-8=2 
-. Number of ways of getting a total of 3 or 5 or 11 


=2+4+2=8 
Total no. of cases = 67 = 36 
ee Ex. 7-26. Each co-efficient i; 
eee -£.2, an ordinary die. Find the prob. ti 
Sol. The roots of the given e. 
Ex. 7-24. Show that the chance of throwing 15 with 3 dice is —~5 708 pew various posebie values 
Since there are 3 co-efficient 


Sol. Consider the expression 


2 643 
ee : igcedieans 
Now various possibilities are 


3 (d=x)* “4 7 c 
(l-x)? 1 
= ;3 6 12 18 -3 2 1 
=x (l- 3x. +3x'* —x' )(l-x) 2 1 5 
= x3(1-3x° + 3x! — x8 (page eae Po cesustes got eee. T eseeee 1 3 
2! 3! r} ; 3 3 i 

Therefore, number of ways of getting a total of 15 
= co-efficient of x’ ‘ ; 4 
2 
4 3.4.5.6.789.10111213.14 _ 3, 3.45.6.78 re 4 1 

12! 6! 
, = 10. ; ‘5 5 
Total number of cases = 6° =216 1 
Redswub a 2 1 6 
.. Reqd. prob. = 316 108° 3 ; 
Ex. 7-25. Show that, in a single throw with two dice, the chance of throwing more than 6 3 2 
6 1 
7 is equal to that of throwing less than 7 each being ot 
1 
Sol. Consider the expression e) 1; ' 
(xt x2 t...4x°)? 

2 8 2 4 
eax 6)2¢1— x)? 4 2 


= x2(1— 2x6 +x! 14204327 44x34..4(r +1)x" +.. 3 
(Values 10, 11 etc, of ‘ac’ are not ¢ 
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43x + 4x3 +...) 


108° 


\a-x)? 


r! 


14, 345678 , 


—_ 6! 


hance of throwing more than 
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=x7(1+2x 43x? 4.429 +54 4625 45x54 4x7 43x28 4.99 +x'04....) 
Therefore, number of ways of getting a number less than 7 


= sum of co-efficients of x7, x3, x‘, x? and x® 


=14+2+344+5=15 
and number of ways of getting a number greater than 7 
=34+4+34+2+1=15 


Total number of cases = 6? = 36 
Therefore, probability of getting a number greater than 7 
= probability of getting a number less than 7 


Ex. 7-26. Each co-efficient in the equation ax? +bx +c =0 is determined by throwing 
an ordinary die. Find the prob. that the equation will have real roots. 


Sol. The roots of the given equation will be real if b? > 4ac. 


Now various possible values of co-efficients in the equation are 1, 2, 3, 4, 5, 6. 
Since there are 3 co-efficients, total number of cases 


= 6° = 216. 

Now various possibilities are : 
ac a c 4ac b No. of cases 
d 1 1 4 2,3,4, 5,6 5x1=5 

2 1 
2 i 2 8 3,4, 5,6 4k 2=8 
1 3 
3 3 1 12 4,5,6 3x2=6 
1 4 
4 : 2 16 4, 5,6 3x3=9 
4 1 
1 5 
5 {5 I 20 5,6 2x2=4 
1 6 
2 3 
6 3 2 24 5,6 2x4=8 
6 1 
1 7 This is not possible as on a die number 
7 7 1 greater than ‘6’ can’t occur. 
2 4 
8 4 2 32 6 1x2=2 
9 . 3 3 36 6 1x1=1 


(Values 10, 11 etc, of ‘ac’ are not possible as 4 x 36). 
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+. Total number of favourable cases 
= 43 
5 Sik ieee 2 Se 
. Reqd. prob. AIG 


Ex. 7-27. Four tickets marked 00, 01, 10, 11 respectively are placed in a bag. A ticket . 
is drawn at random five times, being replaced each time. Find the prob that the sum of the 
numbers on tickets thus drawn is 23. 

Sol. Number of favourable cases = co-efficient of xin 


(oo +x! 42" +x Sie, (txt +x 


ie, (d+x)5+x") 


2 4 10x2 + 5x4 +49) (1+ 5x +10x” 410x329 +5x4 +x) 


ie, (1+5x+10x 
«, Number of favourable cases = 100 


Total number of ways of drawing cards = 4° 


wate _ 100 _ 25 
. Reqd. prob: = "5 256. 


Ex. 7-28. An urn contains a white bails and b black balls. If a+ balls are drawn from 


this urn, find the probability that among them there will be exactly 0. white and B black 


balls. 
Sol. Total number of cases = 4+ 5Cai5- 


Number of ways of drawing © white balls 


= 4 
Co 


and number of ways of drawing B black balls 
b 


. Number of ways of having & white and black balls among (o + B) balls drawn 


= b 
= “cy. Cp 


cy ep 
~ atb , 


Ca +B 


. Reqd. prob. 
Ex. 7-29. A die is cast until a 6 appears. F ind the probability that it must be cast more 


than 5 times. 


" Sol. Reqd. prob. 
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a7. Notations 


(/) Capital letters A, B, € 
(2) (A), (B) ete., denote ti 
(3) (A), (B) etc., denote 


(4) (AB) denotes the simi 
(5) (A+B) denotes the ha 
(6) P(A), P(B) etc., dent 
(7) P(A/B) denotes the ca 
is known that the event B has 
7.3. Theorem of Total Proba 
It states that the probabilii 
events is the sum of the proba 


P(A, + Az 

where A,, Az.....A, are M.E. 
Sol. Let Aj, Ag,...... A 
P(A, + Ay 


Let Vbe the number of cas 
Out of these let 


no. of cases favourable to 


no. of cases favourable to 


Pre eee eee eer eer reer errr errr 


no. of cases favourable ta 


Since A), Aj,..... A,, ari 
distinct and no-overlapping. 
.. No. of cases which are 


of the events A,, Az,....8 4 


P(A, + A+... 


Now by def., 


P(A, + Ay+...44,) = 
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are placed in a bag. A ticket 
| the prob that the sum of the 


10x20 +5x4° +.) 


If a+B balls are drawn from 
exactly 0 white and B black 


among (a +f) balls drawn 


ability that it must be cast more 
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a Jd. Notations 


(/) Capital letters A, B, C etc., denote events. 

(2) (A), (B) ete., denote the happening of events A, B etc. 

(3) (A), (B) etc., denote the non-happenings of A, B etc. 

(4) (AB) denotes the simultaneous happening of A and B. 

(5) (A+B) denotes the happening of at least one of the events A and B. 

(6) P(A), P(B) etc., denote the probabilities of the happening of the events A, B etc. 


(7) P(A/B) denotes the conditional probability of the NGDD EIN of the event A when it 
is known that the event B has already happened. 
7.3. Theorem of Total Probability 

It states that the probability of the happening of any one of the several mutually exclusive 
events is the sum of the probabilities of the happening of separate events i.e., 


P(A, + Ap +....-+A,) = P(A,) + ne ree +P(A,) 
where A,, A,.....A, are M.E. events. 
Sol. Let Aj, Ag,........ A,, be n mutually exclusive events. Then we are to show that 


P(Ay + Agt...tAy) = P(A) + P(Ag tees P(A, ) 


Let Nbe the number of cases which are equally likely, mutually exclusive and exhaustive. 
Out of these let 


no. of cases favourable to A, = 1 


no. of cases favourable to A, =m, 


TOR eee u rene rene renee sen eee rece ceases eeeasereseeeeeneeees 


Cera ere reeeerenaaneeeesenevesec ser neeneeeaneeeeseeeess 


no. of cases favourable to A, =m, 


Since A), Aj,....... A, are mutually exclusive, the cases mj, m,......... m,, are quite 
distinct and no-overlapping. ; 

. No. of cases which are favourable to (A, + Aj +.......... +A,,) (i.e., occurrence of any 
of the events A, Ay,.....000. A,) 


= m +m,+...+m, 


m +My, +....+M, 


P(A, + A3+....+A,) = N 


m  m, m 
ttt 

N WN N 
Now by def., 


Ul 
g 
8 


P(A) 


. P(A, + Ay +...+A,) = P(A,) + P(Ag)+...+P(A,) 
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4.3.1. Generalization of the theorem of total probability for non-mutually exclusive 
events. 
Let Aj, Aq... A, be the events which are not mutually exclusive. Consider A, and 


A, two events. Two mutually exclusive and exhaustive forms in which A, can happen are 


(i) A, happens and A, does not happen i.e., (4,42) 

(ii) A, happens and A, also happens 7.e., (A,Ap) 

Let m, and m, be the number of cases favourable to (4,43) and (A,A,) respectively. 
Then number of cases favourable to (A,) =m +m, . 


EE 


P(A,) = 
non 
where n is the number of equally likely, mutually exclusive and exhaustive cases. 
P(A,) = P(A,Az) + P(A1A2) (1) 


Interchanging A and A, 
P(A,) = P(A,A,)+ P(A}A2) (2) 
Now the three mutually exclusive and exhaustive forms in which (A, + A) can happen 
are 
(i) A, happens and A, does not happen i.e., (A, A). 
(i) A, happens and Ay also happens ie., (A, A2). 
(iii) A, happens and A, does not happen i.e., (A, A). 
-. Asbefore, P(A, + Az) = P(A,Ay) + P(A1A2) + P(A) A2) ...(3) 
Subtracting (1) and (2) from (3) 
P(A, + Az)— P(A,)— P(A2) = -P(A\A2) 
P(A, + Ay) = P(A;) + P(Az)— P(A) AQ) .. (4) 


Now P(A, + Az + A3) = P(A) + Az + A3) 


P(A,) + P(Ay + A3)— P(A Ap + Ag) 

P(A,) + P(Ag +43) — P(Ay Ag + A1A3) 

P(A,) + {P(Az) + P(A3)~ P(A243)} 

= {P(A,Aq) + P(A,A3) — P(AA243)} 
{0 (A, 424,43) = (A, A243)} 


3 3 
= >1P(A))- >) PCA:A;) + (DP P(ArA2As) 


i=l ij=l 
i<j 
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.. In general, 


P(A, + Ap+ 


Remark 1 : If A), Ap... 
etc. 

“. P(A, + Ay 4+...+A,) = 

Remark 2 : (4) is call 
proved without using theore 


7.3.2. Additive Law of pro 
Let A, and A, be twoe 
be the number of cases favo’ 
Number of cases favou 
and umber of cases favour: 


.. Number of cases fav: 


"Le, 
.. Ifn be the total numb 


PUL 


Ex. 7-30. Show that P( 
Sol. In axiom (3) take 


Then 

«. Axiom (3) => P(@)= 
=> P(p) = 0. 

Remark : Taking A,,,; 
Axiom (3) > 

P(A, + Az+....A,) = Pl 
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ty for non-mutually exclusive 


y exclusive. Consider A, and 


1s in which A, can happen are 


Aq) and (A,A,) respectively. 


and exhaustive cases. 


(1) 


...(2) 
in which (A, + Az) can happen 


Ay) ...(3) 
9) 

ly) (4) 
— P(A, Az + A, A3) 


(A3)— P(A2A3)} 
143) - P(A\A,A3)} 
) = (A,A2A3)} 


A,)+(-1)*"! P(A) A243) 
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.. In general, 


| P(A; P(A,A; 
P(A, + A +.. eee o-dr ) 


ijel 
i<j 


n : 
DY PAV jg )ooeeeeeH(-1)"! P(A Ag. Ay) 
i<j<k=l 
i<j<k 
Remark 1 : If A,, Ay,........ A,, are mutually exclusive, P(A;A;)=0, P(A;A;A,) =0 
etc. 
*. P(A, +Az+...+A,) = P(A) + P(A,)+.. +P(A,) 


Remark 2 : (4) is called additive law of probability for two events. It can also be 
proved without using theorem of total probability as seen below : 


7.3.2. Additive Law of probability for the two events. 
Let A; and A, be two events (not necessarily mutually exclusive). Let my, my, and m, 
be the number of cases favourable to (A,),(A,) and (A,A,) respectively. Then 
Number of cases favourable to (A,A,) =m, —m; 
and number of cases favourable to (A,A,) = m, —m, 
. Number of cases favourable to (A, + A) are 
i.é., m, +m, —-m3. 
.. If be the total number of cases which are mutually exclusive and equally likely then 
+m,—-m 
REE ge tals ace 
n 
_ mH, Mp _ ms 
non n 
= P(A,) + P(Az) — P(A,A2) 
Ex. 7-30. Show that P(p)=0 
Sol. In axiom (3) take 


= A, =... () 
Then Ai +Aj+...A, +... 0 
“. Axiom (3) => P(@) = P(@) + P(g)+........ 


=> P(g) =0. 


Remark : Taking A,,, = 9, A,,2 =@, 
Axiom (3) => 
P(A, + A, +....A,) = P(A,)+....+P(A,,) for any positive integer 7. 
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7.3.3. Using axiomatic approach show that 
P(AUB) = P(A) + P(B) — P(AB) 


Sol. We have 

A=ABUAB 

and ABO AB =@ 
«. By axiom (3) 

_ P(A) = P(AB) + P(AB) 

=> P(AB) = P(A)— P(AB) (1) 
Also AUB=BUAB 
and . BONAB=9 


P(A U B) = P(B)+ P(AB) 
= P(B) + P(A)- P(AB). 

Ex. 7-31. Show that P(A) =1- P(A) 
Sol. We have 

AUVA&S and AA ee 

P(S) = P(A) + P(A) 
=> 1  =P(A)+P(A) 
=>  P(A)=1-P(A) 


1 - | 
Ex, 7-32. [f P(A) = rl P(B)= ye ean A and B be disjoint. Explain : 


Sol. Let if possible, 
A and B be disjoint 


=>ANB=9 
=> ACB 
=> P(A) < P(B) , 


which is not true. 
.. A, B can’t be disjoint. 
Ex. 7-33. Prove or disprove : 


If P(A) = P(B) then 4 =B 
Sol. Consider sample space 
S = {e,,€2, €3, €4, €s} 
Where P(e,)=0:2 (G=1,2....... 5) 
Let A =(e3), B= {e), €3, €3,&5) 
Then B = {e,} 


PROBABILITY 


”. P(A) = P(B)=0-2 
We have A = {e,, €3,€4,¢ 
Ex. 7-34, Prove or dispre 
If P(A) =0, P(AB) =0 
Sol. ABCA 

= P(AB) < P(A) =0 

= P(AB)=0 

Ex. 7-35. Prove or dispro 
If P(A) = P(B)= p, then 
Sol. Consider a sample st 


where P(e,) = 0.15, P(e.) 

P(e,) =0.2 = Ples 
Consider A = (e,, ei, €4), 
P(A) =0-15+0-14+0:2 
P(B) =0:15+0-14+0:2 
“ p= P(A) = P(B)=0-« 
AB = {e,,e3} 


*. P(AB)=0-15+0-1=( 
Ex. 7-36. Out of a group 


different birthdays (assume 36 


Sol. Req. 


where (3: 
Ex. 7-37. 1f ACB then . 
Sol. Since Ac B,AB=A 
Also B= BAU BA 
= AUBA 
=> P(B) = P(A) + P(BA) 
=> P(B)> P(A) 
Ex. 7-38. [f P(A) = 0 ana 


P(AB) 21-4 
Sol. We have 
P(A! 
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.. P(A) = P(B)=0-2 . 
) We have A = {e;, €3, €4, €s5} # B= {e, €3, €3, €5}. 
Ex. 7-34. Prove or disprove : 
If P(A) =0, P(AB) =0 


Sol. ABCA 

=> P(AB) < P(A) =0 

=> P(AB)=0 

A) Ex. 7-35. Prove or disprove : 

If P(A) = P(B)= p, then P(AB)< p” 

Sol. Consider a sample space 
| S = {e), €5, €3,€4, €5} 
| where P(e,) = 0.15, P(e) = 0.35, P(e3) = 0-1 
} P(e,) = 0.2 = Ples) 

Consider A = (ened B= {e), €3, és} 

P(A) =0-15+0-14+0:2=0-45 
| P(B) =0-15+0-1+0-2=0-45 

“. p= P(A) = P(B) =0-45 

AB = {e,, e3} 

“. P(AB) =0-15+0-1= 0-25 ¢ (0-45)? = 0-2025. 

Ex. 7-36. Out of a group of 25 persons, what is the probability that all 25 will have 

pint. Explain : different birthdays (assume 365 days in a year and all days equally likely). 
\ Sol. Req. prob. = CO as 
(365) 


where (365)3, = (365).(365—1)...... (365 ~25~1). 
Ex. 7-37. If AC B then P(A) < P(B) 
Sol. Since AC B, AB=A 


Also B= BAU BA 


= AUBA 
=> P(B)=P(A)+P(BA) (*ANBA=@) 
=> P(B)> P(A) (- P(BA) 2 0) / 
Ex. 7-38. /f P(A) = a and P(B) = 8 then 
é P(AB)>1-a-~B. 
Sol. We have 


P(AU B) = P(A)+P(B)— P(AB) 
{1 — P(A)} + {1- P(B)}- P(AB) 
= 2-0-B-P(AB) 
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Now P(AYU B)sl1 
2-a-B-P(AB)<1 
=> P(AB) 21-a-B. 
Ex. 7-39. [f AB=0, show that 
P(A) < P(B). 
Sol. We have 
P(A) = P(AB) + P(AB) 
Since AB = 9, P(AB) =0 
P(A) = P(AB) = P(B)P(A/ B) 
<P(B) (:P(A/B)<]I}. 

7.4. Independent Events Def. 


Two events are said to be independent (in probability sense) if the probability of 
happening of one does not depend on the happening or non-happening of the other. 


741. Theorem of Compound Probability. 

If states that the probability of the simultaneous occurrence of two non-mutually exclusive 
events is equal to the prbability of hapening of one multiplied by the conditional probability 
of the other when it is known that first has already happened 

P(A) P(B/ A) 

i.e., P(AB) = or 

P(B) P(A/B) 

Proof. Let A and B be two non-mutually exclusive events. 

Two mutually exclusive and exhaustive forms in which A can happen are: 

(1) A happens and B does not happen i.e., (AB). 

(2) A happens and B also happens i.e., (AB). 

Let m, and m, be the number of cases favourable to (AB) and (AB) respectively and 
n the number of cases which are equally likely, mutually exclusive and exhaustive. 


Then number of cases favourable to (A) = m, +m, 


P(A) - m +m, 
n 

My 
Now by def., P(AB) = — 

n 

m +My, ms my 

= ——+ .—_+~— = P(A) - 
n m +m, -m, +m, 


Assuming the occurrence of A, out of n only (m, +m) cases are left, out of which m, 
are also favourable to B. 


me gives the conditional probability of B when it is given that A has occurred 
m + my, ~ . 


PROBABILITY 


Interchanging A and B 


Remark 1. If A and B ar 
the happening or non-happer 


P(BIA)=i 


P(AB) = P( 
Converse of this also ! 
independent. 
(2) Some authors use “ 
instead of independence. 
Ex. 7-40. If A and B are 


(i) A and B 

Sol. Since A and B dre i1 
P(AB) = P(A. 

(i) Now 


=> A, B are independen 

Similarly (ii), (ii), can b 

Ex. 7-41. If P(B)>0, s, 
P(p/ B)=0 


Proof. P(g / B)= so 


_ PQ) 

P(B) 
Ex. 7-42. If P(B) > 0, sh 
P(ATI 


Sol. P(A/L 
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y sense) if the probability of 
-happening of the other. 


e of two non-mutually exclusive 
| by the conditional probability 
d 


its. 
A can happen are: 


AB) and (AB) respectively and 
slusive and exhaustive. 


= P(A)» —2— 
my +My 


cases are left, out of which m, 


nit is given that 4 has occurred 
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—2 = P(B/ A) 
m +m 
o P(AB) = P(A)P(B/ A) 
Interchanging A and B 
P(AB) = P(B) P(A / B) 
Remark 1. If A and B are independent, prob of happening of B (or A) is not affected by 
the happening or non-happening of A (or B) 
P(B/ A) = P(B) and P(A/ B) = P(A) 
P(AB) = P(A)P(B) 
Converse of this also holds i.e., if this condition is satisfied events A and B are 
independent. 
(2) Some authors use “statistically independence” or “stochastically independence” 
instead of independence. 
Ex. 7-40. If A and B are independent then so are 
(i) Aand B (ii) A and B 
Sol. Since A and B are independent, we have 
P(AB) = P(A) P(B) 


(i) Now P(AB) 


(iii) A and B. 


P(A) - P(AB) 

P(A) - P(A)P(B) 
P(A){- P(B)} 
P(A)P(B) 


=> A,B are independent 

Similarly (ii), (iii), can be proved. 

Ex. 7-41. Jf P(B)>0, show that 
P(g / B)=0 


_ P(pB) 
Proof. P(p/ B)= “P(B) 


_- PO) _9 
P(B) 
Ex. 7-42. If P(B) > 0, show that 


P(A / B)=1- P(A/B) 


(.° P(g) = 0) 


~,»,_ P(AB) 
Sol. EAB) aay. . -(1) 
Also P(B) = P(AB) + P(AB) 


=> P(AB) = P(B)—- P(AB) 


=1-P(A/B) 
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Ex. 7-43. Prove or disprove: pairwise independence of events does not imply 
independence. 
Sol. Consider an experiment of tossing of two cubical dice. 


Let A,: odd face on first die 
A,: odd face on second die 


A;: sum of numbers on faces of two dice is odd. 


For A;, possibilities are : 


(1, 2), (1, 4), (1, 6), (2, 1), (2, 3), (2; 5) 
(3, 2), (3, 4), (3, 6), (4, 1), (4, 3), (4, 5) 
(5, 2), (5, 4), (5, 6), (6, 1), (6, 3), (6, 5) 


181 


For A, 7A, possibilities are : 


(1, 1), (1, 3), (1, 5), 3, 1), G, 3), B, 5) 
(5, 1), (5, 3), ©, 5) 


+, P(A, 0 Ag) = =i- P(A,) P(Ag) 


For A, © A3, possibilities are : 
(1, 2), (1,4), (1, 6), (3, 2), (3, 4)» 3, 6), (5; 2), (5, 4), (5, 6) 


PARA ; = P(A,) P(A3) 


36 
Similarly, 
P(A, OA joo ee PA )P(A3) 
2 36° 4 2 3 


.(Ay3 Ag); (A: Az); (A233) are independent. 
For A, A, 1A; there is no possibility. — 
te P(A, VAz 0g) = 0 # PCA) )PCAz) PCA) 
7.4.2. 1f P(B)>0 and Aj), Az... A, are MLE. events, show that 
P(A, U Ay U...U A,B) = P(A, | B)+..+P(A, /B) 


PA(AY..VA,) BY 


Sol. P(A, V..VA, / B) = P(B) 


_ P(A, Bv......UA,B) 
P(B) 


: 3 P(A,B) 


P(B) {. A; B's are also M.E.} 


i=1 


PROBABILITY 


7.4.3. If P (B) > Oand # 
P(A, UA, / B) = P(A, / 


Sol. P(A, U 


Ex. 7.44. Given P(A) > 


F 
show that 
Sol. We have 
F 
and Ff 
=> 


7.4.4. If By, Baye E 


and = =©6-«~P(B,) > 0 


then for any event A, 


Proof. We have 
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of events does not imply 


(5, 4), G, 6) 


, Show that 
P(A, / B) 


{. A; B's are also M.E.} 
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= DPA! BD. 


7.4.3. If P (B) > 0 and A,, Ay any two events show that 
P(A, U A, / B) = P(A, / B) + P(A, / B)— P(A\A / B) 


P{A, U Az )B} 
Sol. P(A, U Az! B) = ~~ pe 
P(A,BU ApB) 
P(B) 
P(A,B)+ P(A,B) — P(A, A2B) 
P(B) 

P(A, B) Fe P(A,B) _ P(A, A,B) 

P(B) P(B) P(B) 
P(A, / B) + P(A, / B)— P(A, A, / B) 
Ex. 7.44. Given P(A) >0, P(B) > 0 and 

P(A/ B) = P(B/ A) 


show that P(A) = P(B). 
Sol. We have 
P(AB) 
P(A/ B) = PCB) 
P(AB) 
and P(B/ A) = P(A) 
P(AB)  P(AB) 
P(A) —~P(B) 
= P(A) = P(B) 
7.4.4. If By, By... B,, are M.E. events s.t. 
- VB, 
S= 7 
and P(B,;)>0 (j =1,2......n) 
then for any event A, 
P(A) = EP(A/B,)PCB)) 
Proof. We have 
S= UB, 
j 
A=AS 
AUB 


210 MATHEMATICAL STATISTICS 


Also AB ;'s are M.E. 


P(A) = 2 PCAB,) 
= = P(A/B;)P(B;). 
, j 
Remark : This theorem remains true if n is infinite. 
Cor : Since 
BUB=S and P(B)>0,P(B)>0 
we have 


P(A) = P(A/ B)P(B)+ P(A/ B)P(B). 
Ex. 7-45. Under what conditions does the following equality hold : 
P(A) = P(A/ B)+ P(A/B) 


Sol. We have 
P(A) = P(A/ B)P(B) + P(A/ B)P(B) 
and by given 
P(A) = P(A/ B)+ P(A/B) 
Subtracting 
0 = P(A/ B){l- P(B)}+ P(A/B){l- P(B)} (1) 
since 1>P(B)>Oand1>P(B)>0 
we have 
1-P(B)>0 and 1-P(B)>0 
: (1) => 


P(A/ B) =0= P(A/B) 
=> P(AB)=0 and P(AB) =0 
=> P(A) = P(AB)+ P(AB) =0 

=> A=@. 
Ex. 7-46. Jf A and B are two events and the probability P(B) #1, prove that 
P(A/ B) = {P(A) — P(AB)}/ {1- P(B)} 

Hence show that 

P(AB) > P(A) + P(B)~1. 
Sol. By compound prob. theorem 

P(AB) = P(B)P(A/B) 


=, _ P(AB) 
= P(A/B)= PE) (1) 
Now P(A) = P(AB)+P(AB) 
=> P(AB) = P(A)-(AB) 


Also ~ P(B) =1- P(B) 


PROBABILITY 


Ex. 7-47. Three fair dice are 
the probability that 
(i) the sum of the faces is } 
(ii) one face is an ace. 
Sol. Let E : No two die sho 
S : Sum of the face 
P(E) = 6.54 _ 5 
666 9 
(2) To find P(S=7O E): 
Different possibilities are : 
(1, 2, 4); (1, 4, 2); (2, 1, 4); 


PS=7OE)= 28, 
21 
P(SS=70E) 
P(E) 
(ii) Let A : one face is an aci 
To find P(AN E): 


If ace comes on first die, dif 
(i, 2, 3); (1, 3, 4). 
(1, 3, 2); (1, 4, 3); 
But there are three dice and 


S(S=7/E)= 


. No. of possibilities = 3 x 


“PARR Se 
216 9 

“ PADS Oe 
P(E) 


‘ 


Ex. 7-48. A die is thrown as. 
turn up at the first throw, what is th 


Sol. Let £, : Event that 6 dc 


E, : more than fow 
throws). 


P(E, E,) = Prob. that 6 does 
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hold: 


(1) 


) #1, prove that 


(1) 
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P(A)— P(AB) 


(1) >. P(A/B)= 1 PB) (2) 


Now P(A/B)<1 
(2)=> P(A)- P(AB) <1-P(B) 
=> P(AB) 2 P(A)+P(B)-1L. 
Ex. 7-47. Three fair dice are thrown once. Given that no two show the same face. Find 
the probability that 
(i) the sum of the faces is 7 
(ii) one face is an ace. 


Sol. Let E : No two die show the same face 
S : Sum of the faces. 


P(E) = 654 5 
666 9 


(i) To find P(S=70E): 


Different possibilities are : 
(1, 2, 4); (1, 4, 2); (2, 1, 4); (2, 4 1): (4, 1,25 (42, 1) 


AsetAnee ee 

216 36 
55 <7/ 8) 2 PSZTOB) 1/36 1 
P(E) 579 20 


(ii) Let A : one face is an ace. 
To find P(A E): 
If ace comes on first die, different possibilities are : 
(1, 2, 3); (1, 3, 4); C1, 4, 5); (1, 5, 6) 
(1, 3, 2); (1, 4, 3); C1, 5, 4); (1, 6, 5) 
But there are three dice and ace can occur on any dice. 
“. No. of possibilities = 3 x 8 = 24. 


Paha se 
216 9 
. PA Se eet 
P(E) 9 5 


5/9 


Ex. 7-48. A die is thrown as long as necessary for a 6 to turn up. Given that 6 does not 
turn up at the first throw, what is the probability that more than four throws will be necessary ? 


Sol. Let E, : Event that 6 does not tum at the first throw. 


E, : more than four throws are necessary (i.e., 6 does not turn up in first four 


P(E,)= %. 


P(E,E,) = Prob. that 6 does not turn up in first four throws. 


throws). 
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4 
=(%) 
P(E, 1B) = 72) =(%6) -=. 


Ex. 7-49. Given P(A)=0-5 and P(AU B)=0-6, find P(B) if: 
(i) A; B are mutually exclusive. 
(ii) A; B are independent. 


' (iii) P(A/ B)=0-4. 


Sol. (i) P(A B)=0 as A, Bare ME. 
“. P(AU B) = P(A) + P(B) 
> 0-6 = 05 + P(B) 


P(B)=0-1 
(ii) P(A B) = P(A)P(B), as A; B are independent 


(0:5) P(B) 

Also P(AUB) = P(A)+P(B)- P(A B) 
0:6 = 0-5+ P(B) {1-0-5} 
0:1 = P(B)(0-5) 


p(py = Phat 20-2 
= 0575 
(iii) P(A B) = P(B)P(A/B) =0-4P(B) 


P(AUB) = P(A)+P(B)- (ANB) 
0:6 = 0-5 + P(B) {1-0-4} 


1 
Ex. 7-50. If A and B are independent and P(A) = P(B/ A) = 3° 


find P(AU B). 
Sol. Since A and B are independent, 


1 
P(B) = BE) Sa 


I iol 
P(AB) = P(A)P(B) = 5X5 = 
P(AU B) = P(A)+ P(B)- P(AB) 
i 1 1 
ara. 


1 
Ex. 7-51. /f P(A) = P(B) = P(B/A)= 3° are A, B independent ? 


PROBABILITY 


Sol. P( 


.. A, B are independent. 
Ex. 7-52. [fA and B are in 


Sol. P(AB VU, 


Ex. 7-53. /f P(B) = P(A/B 


Sol. PC. 


P(AI 


Ex. 7-54. Suppose B,, By, B 


(j =1, 2,3). Find P(A). 
Sol. P(AL 


As Bs are M.E., 
P( 


Ex. 7-55. Prove the followin 
(i) if P(A/ B)> P(A) then 
(ii) if P(B/ A) = P(B/A), 

(iii) if P(A)=a, P(B)=b, 
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P(B) if: 


independent 


3) 


B) 
B) 


— 


rendent ? 


PROBABILITY 
Sol. P(AB) = P(A)P(B/A) 
oe i ae 
pg ee = P(A) P(B 
So a (A) P(B) 


. A, B are independent. 
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1 ai, Sad 
Ex. 7-52. [fA and B are independent and P(A) = P(B) = ai find P(AB U AB). 


Sol. P(AB U AB) = P(AB)+P(AB) 


Ex. 7-53. If P(B) = P(A/B) 


(j =1, 2,3). Find P(A). 


As Bs are M.E., 


P(A)P(B) + P(A)P(B) 


1 
= P(C/ AB) = 3° find P(ABC). 


Sol. P(AB) = P(B)P(A/ B) =— 


P(ABC) = P(AB)P(C/ AB) 


Sol. P(AB)) = P(B;)P(A/B;) 


P(A) = P(AB,)+ P(AB,)+ P(AB3) 


Mh. 25S 3t 
—+—+—=-. 
18 18 18 3 


Ex. 7-55. Prove the following : 

(i) if P(A/ B)> P(A) then P(B/ A) > P(B) 

(ii) if P(B/ A) = P(B/A), then A and B are independent 
(iii) if P(A)=a, P(B)=b, then 


PAI B) > 97° =, : 


(. AB, AB are M.E.) 


1 ’ 
Ex. 7-54. Suppose B,, B,, B3 are mutually exclusive. If P(B;) = 5 and P(A/B;)= ‘ 


a 
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Sol. (i) P(A/B)P(B) = P(AB) = P(A)P(B/ A) 
P(AIB) _ P(B/A). , 
P(A) P(B) 
=~ P(B/ A)> P(B) 
(ii) By given © 
P(B/A) = P(BIA) 
P(BA) _ P(AB) 
= P(A) P(A) 
> . P(A)P(BA) = P(AB){l- P(A)} 
=> . P(A){P(BA)+ P(AB)} = P(AB) 
=> P(A) P(B) = P(AB) 


{. P(BA) + P(AB) = P(B)} 

. A, B are independent. 

(iii) By additive law, 
P(AU B) = P(A) + P(B) — P(AB) 
also P(AVU B)s1 

P(A) + P(B)- P(AB) <1. 
P(AB)2a+b-1 


an P(B) P(A/B)2a+b-1 
= P(A/B) 29°. 


Ex. 7-56. A single die is tossed; then n coins are tossed, where n is the number on the 
die. Find the probability of exactly two heads. 
Sol. Possible values of 7 are : 
2, 3, 4, 5, 6 


1 
and probability of each is - 


2 3 4 5 \6 
1 2 l 3 l 4 1 5 1 6 1 
.. Reqd. prob. = :{ a5 cf a(3}] + als +° C4 3 +° Co ry 


fi 3.3 5 15 33 
= 4 ttl ++ Sper 
6|4 8 8 16 64) 128 
Ex. 7-57. Prove or disprove the following : 
if P(A) > P(B) then P(A/C) > P(B/C). 

Sol. Consider biased tetrahedron with faces a, b, c and d with respective probabilities 
0-1, 0-2, 0-2 and 0-5 ; 

A= {a,b,d} 

B = {b,c}, C={a,b,c} 


Define 


PROBABILITY 
then 
and 
Now A 
B 
P(A 
P(B 


Ex.-7-58, Let B,, Bp, 


P(B;})>0 and P(A/B;)=p,, 


Sol. Let P( 
Then P 
Also 
P(A 
P(AT. 


MATHEMATICAL STATISTICS PROBABILITY 2 - 
==  e re ee 


\) then *” P(A) = 0-14+0-2+0-5=0-8 
P(B) = 0-2+0-2=0-4 
a P(A) > P(B) 
and P(CY = 0-140-240-2=0-5 . 
Now ANC = {a,b}=> PANC)=0-140-2=0-3 
BOC = {b,c}=> P(BAC)=0-2+0-2=0-4 
PIANC) 0-3 
P(A/C) = "Re GS 
_ P(BAC) 0-4— 
P(BIC) = PC) 05778 
P(AIC) + P(BIC) 


{.: P(BA) + P(AB) = P(B)} 
Ex. 7-58. Let B,, By,........ B, be mutually disjoint and let B= (Ja... Suppose 
: = 


h | P(B;)>0 and P(AIB;)= p, j= 1Qoe... Show that P(A/B) = p. 
Sol. Let P(B;) = p; >0 
n n 
Then P(B) = 2? (B)=>oP; 
jel jel 
Also AB = (JAB; and AB; are mutually disjoint. 


' where n is the number on the oe 
n 

P(AB) = 2, PCAB)). 
j=l, 


z > P(B;) P(A/B,) 
j 


told) o(')| - Lond Sp 


iti “ P(A/B = p. 
d with respective probabilities (A/B) P(B) 
: Bel 
j 
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Ex. 7-59. It is given that 
1 + AQ 5° eee ra) 


where P(A) stand for the probability that A, does not happen. Determine P(A,) and 


P(A,). Hence show that the events A, and A, are independent. 
Sol. Since total probability is always unity, 
P(A,)+ P(A3) = 1 


P(A,) = 1- P(A,) 
ees 
2. 2 


By additive law for non-mutually exclusive events, 
P(A, + Ay) = P(A;)+ P(A) — P(A\A2) 


5 
2 
P(A,) = 3 
21 1 
PCA, )P(Ag) = Fg = = PCALAd) 


.. Events A, and A, are independent. 
Ex. 7-60. Discuss and criticise the following : 


2 1 1 
P(A) = 7 P(B) = a P(C)= 6 


where A, B and C are mutually exclusive events. 
Sol. Since three events A, B and C are mutually exclusive, by total probability theorem 


P(A+B+C) = P(A)+ P(B)+ P(C) 


2. Fd 
sto 4+ — 
3 4 6 
is 
12 


Since the probability is always less than unity, the statement is wrong. 

Ex. 7-61. Two packs of cards are made-up in such a way that the first pack consists of 
39 red cards and 13 black cards; second pack consists of 39 black cards and 13 red cards. 
A sampling experiment is carried out in the following way: A card is drawn from the first 
pack, if it is red, a second card is drawn from the same pack after replacing the first red 
card. The colour of the second card drawn from the first pack is noted. If the first card 
drawn from the first pack is black, then the second card is drawn from the second pack and 
the colour of the second card is noted. Both the cards are then replaced in their respective 
packs. What is the probability that the second card is red ? 

Sol. Two different possibilities are : 

(1) First card drawn is red. 

(2) First card drawn is black. 


PROBABILITY 


(1) Now the probability | 


Since the first card drawi 


but after replacing the first re 


.. Probability of drawing 


This is the conditional 


known that card drawn in firs 


. By the theorem of com 


and second draws. 


(2) The probability of dr: 


Since the first card drawn 
. Conditional probabilit 


drawn in first draw. 


.. Probability of drawing 


.. Since two possibilities 


Req. prob. 
Ex. 7-62. From each of th 


What is the probability of thei 


Sol. Probability of selecti 


There are only two mutua 
(1) All the partners are of 
(2) All the partners are of 
By compound probability 


.. By theorem of total prc 
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PROBABILITY 
(1) Now the probability of drawing a red card from the first pack. 


39 
39 +13 


OP 3 
524 
Since the first card drawn is red, the second card is also to be drawn from the first pack 
but after replacing the first red card. 


. Probability of drawing a red card in second draw = — 


This is the conditional probability of drawing a red card in second draw when it is 
known that card drawn in first draw was red. 
‘. By the theorem of compound probability, the probability at drawing red cards in first 
and vecond draws. . 


aoe: 
4 4 16 
(2) The probability of drawing a black card from the first pack 
: lS uit 
39+13 52 4 


Since the first card drawn is black, the second card is to be drawn from the second pack. 
. Conditional probability of drawing a red card when it is known that a black card was 
drawn in first draw. 


13 


~ 30413 4 
. Probability of drawing a black card in first draw and a red card in second draw 


.. Since two possibilities are mutually exclusive, by the theorem of total probability. 


7 ca D2. 103 
“1616 16 8 

Ex. 7-62. From each of three married couples one of the partners is selected at random. 
What is the probability of their being all of one sex ? 

Sol. Probability of selecting a partner (male or female) from either couple 


Req. prob. 


t 
There are only two mutually exclusive possibilities : 
(1) All the partners are of male sex, 
(2) All the partners are of female sex. 
By compound probability theorem, probability of either possibility 


.. By theorem of total probability, 


4 
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eqd. prob. Tee go 
Ex. 7-63. In above question, show that the probability of choosing two men and one 
woman is ~- 
Sol. Three possibilities are : 
Ist couple 2nd couple 3rd couple 
M _M < W 
M Ww M 
W M M 
Probability of choosing a partner (male or female) from either couple 
1 


sib 

i ee: 

as 

: 8 i * 
By theorem of total probability, probability of choosing two men and one woman 

1 1 1 

=—+—+- 
8 8 8 

aed 


Ex. 7-64. A number x is chosen at random from the integers 1, 2, 3, ...n, and A and B 
denote the event that x is a multiple of 2 and 3 respectively. Show that A and B are independent 


events when n = 96 but not when n = 100. 
Sol. The no. of integers in 1, 2,....2, which are divisible by the integer m is the greatest 


n 
integer less than — 
m 


(i) When . n= 96 
No. of integers which are divisible by 2 


= 7% a ag 
2 


No. of integers which are divisible by 3 


ee) 
3 


No. of integers which are divisible by 3 and 2 both i.e., by 6 
222 i6 
6 


«. P(A) = Probability that x is a multiple of 2 


PROBABILITY 


P(B)= Probability that x 
and P(AB) = Probability tt 


«. P(AB) = P(A)P(B) 

.. Events A and B are inde 
(ii) When 

No. of integers which are ¢ 


No. of integers which are ¢ 


No. of integers which are d 


and P( 


is P( 
. Events A and B are not i 
Ex. 7-65. A bag contains 5 
replaced and then a second dra 
drawn were of different colours 
Sol. The two different poss 
(1) The first draw gives wh 
(2) The first draw gives ble 
Since the ball drawn in the { 
(white or black) in two draws ai 
Now the probability of dra’ 


and the probability of drawing < 


* 
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Vu 

7 f 

ther couple 


yssibility 


7o men and one woman 


rs 1, 2, 3,...n, and A and B 
that A and B are independent 


the integer m is the greatest 


PROBABILITY 219 


Seok 
9 2 
P(B) = Probability that x is a multiple of 3 
eccaees 
96 3 
and P(AB) = Probability that x is a multiple of 2 and 3 both i.e., 6 
ze EO 
96 6 


«. P(AB) = P(A)P(B) 
. Events A and B are independent. 


(ii) When n= 100 
No. of integers which are divisible by 2 
= 100 =50 
2 


No. of integers which are divisibledy 3 


100 
= greatest integer less than ae 33 


No. of integers which are divisible by 3 and 2 bothi.e., 6 


100 
= greatest integer less than e 16 


4) 100 2 

pipe 

3) 100 
and P(AB) = ~°- 
an (AB) 100 


se P(AB) # P(A)P(B) 

“. Events A and B are not independent. 

Ex. 7-65. A bag contains 5 white and 4 black balls. A ball is drawn from this bag and 
replaced and then a second draw of a ball is made. What is the probability that the two balls 
drawn were of different colours ? 

Sol. The two different possibilities are : 

(1) The first draw gives white ball and the second draw gives black ball. 

(2) The first draw gives black ball and the second draw gives white ball. 

Since the ball drawn in the first draw is replaced, the probabilities of drawing either ball 
(white or black) in two draws are same. , 

Now the probability of drawing a white ball 


5 


be Ste SP 
and the probability of drawing a black ball 
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.. By the theorem of compound probability, probability of possibility (1) 


5 4 20 
=—-xo Se 
9 9 81 
and probability of possibility (2) 
_4 <> _ 20 
"9 <9” 81 


By the theorem of total prowabulity, the probability of getting two balls of different 
salou: in two draws 
_ 20, 20 40 
merit rie 81 
Ex. 7-66. An urn contains 4 white and 5 black balls, a second urn contains 5 white and 
4 black balls. One ball is transferred from the first to second urn, then a ball is drawn from 
the second urn. What is the probability that it is white ? 
Sol. There are two different possibilities. 
(1) The ball transferred from the first to second urn is white. 
(2) The ball transferred from the first to second urn is black. 
The probability of drawing a white ball from second urn in these two possibilities will 
be different. So we consider these two possibilities separately. 
(1) Now probability of drawing a white ball from the first urn 


Since the ball transferred from first to second urn is white, total number of white balls in 
second run 


The number of balls in second urn 
=6+4=10 
.. Probability of drawing a white ball from second urn 
, eee 35, 
“10° 5 
*, By the dedi of compound probability, the probability of transferring a white ball 
and then drawing a white ball from the second urn 
eed 
~9 5 15 


De: 5 
9. 9 

Total number of white balls in second urn = 5 and number of balls in second urn = 10. 

.. Probability of drawing a white ball from the second urn 

ace 
10 2 

. The probability of transferring a black ball and then drawing a white ball from the 

second urn 


‘PROBABILITY 


The two possibilities (1° 
probability, the probability o 
white ball from the second. 


Ex. 7-67. Three urns co 
balls, 2 white and 2 black be 
then one from the latter is tra 
urn. What is the probability 

Sol. There are in all fou: 

(1) The white ball is trar 
transferred from the second 1 

(2) The white ball is trar 
is transferred from the secon 

(3) A black ball is trans: 
transferred from the second 1 

(4) A black ball is trans: 
transferred from the second t 

(1) Probability of drawit 


After transferring the wl 
Number of white balls i 
and total number of balls in t 
.. Probability of drawin 
ball from the first to the seco 


After transferring a whit 
Number of white ball in 
and total number of balls in t 
. The probability of dr 
ball from the second to the tk 


*. By the theorem of co 
ball from the first to the seco: 
drawing a white ball from the 


(2) After transferring wt 
balls in the second urn = 1. 

. The probability of dra 
white ball from the first to the 
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The two possibilities (1) and (2) are mutually exclusive. Hence by the theorem of total 
probability, the probability of transferring a ball from first urn to second and then drawing a 
white ball from the second. 

4 5 49 
= —4+— = — 
15 18 90 

Ex. 7-67. Three urns contain respectively I white, 2 black balls, 2 white and 1 black 
balls, 2 white and 2 black balls. One ball is transferred from the first urn into the second; 
then one from the latter is transferred into the third. Finally one ball is drawn from the third 
urn. What is the probability of its being white ? 

Sol. There are in all four different possibilities : 

(1) The white ball is transferred from the first to the second urn and then a white ball is 
transferred from the second to the third urn. 

(2) The white ball is transferred from the first to the second urn and then the black ball 
is transferred from the second to the third urn. 

(3) A black ball is transferred from the first to the second urn and then a white ball is 
transferred from the second to the third urn. 

(4) A black ball is transferred from the first to the second urn and then a black ball is 
transferred from the second to the third urn. 

(1) Probability of drawing the white ball from the first urn 
l 
3 

After transferring the white ball from the first to the second urn, 

Number of white balls in the second urn = 3 
and total number of balls in the second um =4 


.. Probability of drawing a white ball from the second urn after transferring the white 
ball from the first to the second urn 


3 
4 
After transferring a white ball from the second to the third urn, 
Number of white ball in the third urn = 3 
and total number of balls in the third un =5 
.. The probability of drawing a white ball from the third urn after transferring a white 
ball from the second to the third urn 


- .». By the theorem of compound probability, the probability of transferring the white 
ball from the first to the second urn; then a white ball from the second to the third and then 
drawing a white ball from the third urn 


(2) After transferring white ball from the first to the second urn, the number of black 
balls in the second urn = 1. 

.. The probability of drawing the black ball from the second urn after transferring the 
white ball from the first to the second urn. 
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oa 
id 
After transferring the black ball from the second to the third urn, 
the number of white balls in the third urn = 2 


.. Probability of drawing a white ball from the third um after transferring the black ball _ 


fein the second to the third urn 
a2 
= 


.. The probability of transferring the white ball from the first to the second; then the 
black ball from the second to the third and then drawing a white ball from the third um 


(3) Proceeding as in above two cases, the probability of transferring a black ball from 
the first to the second urn; then a white ball from the second to the third urn and then drawing 
a white ball from the third urn 


(4) The probability of transferring a black ball from the first to the second; a black ball 
from the second to the third and then drawing a white ball from the third urn 


Since the four possibilities are mutually exclusive, by the theorem of total probability, 
the probability of transferring a ball from the first to the second; then a ball from the second 
to the third and then drawing a white ball from the third un =~ 


9 2 12 8 31 
+— 


ss +>, 
60 60 60 60 60 


Ex. 7-68. In a bag there are six balls of which 3 are white and 3 are black. They are 


drawn successively without replacement. What is the chance that the colours are alternate ? 
Sol. Let (W) be the event that in a draw white ball appears and (B) be the event that 
black ball appears. The possible sequences are 
(W) (B) (W) (B) (W) (B) 


and (B) (W) (B) (W) (B) (MW) 
Probability of drawing a white (or black) ball in first draw. 


Rees 
6 2 
Probability of drawing a black (or white) ball in second draw when the ball drawn in 


first draw is white (or black) 
S) 
5 


Probability of drawing a white (or black) ball in third draw when in first two draws 
white (or black) and black (or white) balls have been drawn. - 


. 
“a 


PROBABILITY 


Probability of drawing a 
drawn is white (or black) 


Probability of drawing a 


and probability of drawing a1 
.. By compound probabi 


. By total probability 


Ex. 7-69. An urn contain. 
unnoted laid aside. Then anot 
Sol. There are two mutua 
(1) In the first draw a whi 
Prob. of drawing a white 


Number of white balls in 


Therefore, conditional prc 
draw a white ball has been dra 


Therefore, by compound ° 
draw 


(2) In the first draw a blac 
Prob. of drawing a black 1 


Number of white balls in: 


Therefore, conditional prc 
a black ball has been drawn 
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Probability of drawing a black (or white) ball in fourth draw when in third draw the ball 
drawn is white (or black) 


Probability of drawing a white (or black) ball in fifth draw 
1 


~ 9 
and probability of drawing a black (or white) ball in sixth draw = 1. 


.. By compound probability theorem, probability of either sequence 


-QQQEDe 


eat 
20 . 
.. By total probability theorem, probability of getting balls of alternate colours 
me 2 x os wae i 
20 10 


Ex. 7-69. An urn contains 3 white and 5 black balls. One ball is drawn and its colour 
unnoted laid aside. Then another ball is drawn. Find the probability that it is white. 

Sol. There are two mutually exclusive possibilities : 

(1) In the first draw a white ball appears. 

Prob. of drawing a white ball in first draw 


me) 
8 
Number of white balls in the urn before second draw 
=2. 
Therefore, conditional probability of drawing a white ball in second draw, when in first 
draw a white ball has been drawn 


aie 
7 


Therefore, by compound probability theorem, prob. of drawing a white ball in second 
draw 


(2) In the first draw a black ball appears. 
Prob. of drawing a black ball in first draw 


5 


"8 
Number of white balls in the urn before second draw 
=3 
Therefore, conditional prob. of drawing a white ball in second draw, when in first draw 
a black ball has been drawn 
3 
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Therefore, prob. of drawing a white ball in second draw 


_ 6 A 150.21 
56: .56:-56> 

Ex. 7-70. A lady declares that by taking a cup of tea with milk she can discriminate 
whether the milk or tea-infusion was first added to the cup. It is proposed to test this assertion 
by means of an experiment with 10 cups of tea, five made in one way and five in the other 
and presenting them to lady for judgement in random order. 

Calculate the probability on the null hypothesis (i.e., the lady has no discrimination 
power) that the lady would judge correctly all the ten cups, being known to her 5 are of each 
kind. 

Suppose that the tea cups were presented to the lady in five pairs, each pair to consist 
of cups of each kind in a random order. How would the probability of correctly judging with 
every cup on the null hypothesis be altered in this case ? 

Sol. (a) When tea cups are presented in random order : 

No. of ways of presenting 10 cups, 5 of each kind 


Reqd. prob. 


= 252. 
Out of these 252 ways, the cups are presented in any one manner. The lady has to find 
which one is that method. 
No. of favourable cases = 1. 


Reqd. prob. = 759 


(6) When cups are presented in 5 pairs : 
Two ways of presenting a pair are : 
MI; IM 
where ‘M7’ stands for the cup prepared by taking milk first and ‘/ for the cup prepared by 
taking infusion first. 
' When a pair is presented to the lady, she has to find, out of these two which one is the 
method used. 
“. Probability of correct judging for each pair 
1 
= 
As the presentation of various pairs is independent of each other, by the theorem of 
compound probability, the joint probability of correctly judging the 5 pairs 


: (4) al 
args: 32° 


Ex. 7-71. /n a group of equal number of men and women 10% men and 45% women 
are unemployed. What is the probability that a person selected at random is employed ? 
Sol. Probability for a man to be unemployed 


PROBABiL:TY 


and probability for a woman to 
.. Probability for a man to 
and probability for a woman to 


Since the group contains e 


1 
man = 5 and same is probabili 


.. Probability of selecting 
and pre vability of selecting an ¢ 


The selected person may 
probability, probability of selec 
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theorem of compound probabilit 
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nee tA 


and probability for a woman to be unemployed 


Sp es 
100 20 
”. Probability for a man to be employed 
yeaa 
10 10 
and probability for a woman to be employed 
apnoea 
20 20 


Since the group contains equal number of men and women, probability of selecting a 


1 Le 
man = 5 and same is probability of selected a woman. 


. Probability of selecting an employed man 


-(2}Ci) 


and pre vability of selecting an employed woman 


-(a}le0) 


The selected person may be either man or woman. Hence by the theorem of total 
probability, probability of selecting an employed person. 


-(3)Cis)+(a)ls) 


1 2 : at aioe 

~ 210 20f 40° 
Ex. 7-72. In a random sample of 1000 residents of a large city, 700 were found to be 
reading newspaper A and 400 were found to be reading newspaper B. On the hypothesis 
that the habits of reading newspapers A and B are independent of each other, (i) what is the 
chance that a person selected at random would be reading both the newspapers, (it) How 


many persons out of 1000 should be expected to be reading both newspapers ? 
Sol. Probability for a person to be reading newspaper A 


eg EE 
1000 10 — 
and probability for a person to be reading newspaper B 
i SON 
1000 5” 


Since the habits of reading newspapers A and B are independent of each other, by the 
theorem of compound probability, the probability that a person selected at random would be 
reading both the newspapers 
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.. No. of persons out of 1000 to be reading both the newspapers 


= ue x 1000 = 280. 
25 


Ex. 7-73. The probability of n independent eventS AVE Py, Pys Pas. P,- Find an ~ 


expression for probability that at least one of the events will Happ 
Sol. Since total prob. is unity, 


Prob. of non-happening of Ist event = 1-p, 


Prob. of non-happening of 2nd event = 1- p, 
, Prob. of non-happening of nth event = 1—- p, 


Since the events are independent, by compound prob. theorem, Prob. of non-happening 
of all the events 


= (1- py) = pp)... = py) 
.. Prob. of happening of at least one of the events 
= 1-(1- p,) (1—py).-1— Py) 
Ex. 7-74. A problem in statistics is given to students whose chances of solving it are 
il 1 
2°3 and ri respectively. What is the probability that the problem will be solved ? 
Sol. The problem will be solved if at least one student solves it. 


: Dt gel 
ere Pt 3° P2 3°P3 4 
1 1 1 
.. Read. . 1-}1-— |} 1-—])1-— 
ead prob ( >| Al i] 
_3 
4 


Ex. 7-75. Three groups of children contain 3 girls and | boy, 2 girls and 2 boys and 
3 boys and | girl. One child is selected at random from each group. Show that the chance 


13 
that the three selected consist of 1 girl and 2 boys is 32° 


3 
Sol. Probabilities of selecting a boy from three groups are rine and 4 respectively 


1 
and probabilities of selecting a girl from three groups are —,— and ri respectively. 


42 
Different mutually exclusive possibilities of required selection are : 
Ist group 2nd group 3rd group 
1 girl 1 boy’ 1 boy 
1 boy 1 girl 1 boy 
lboy | 1 boy 1 girl 
By theorem of compound probability, probabilities of these possibilities are 
3 13 9 11 3~—= 3 ie ee Gees 
Bae ee Sang eS 


424 32°42 4 32 424 32 
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Therefore, by the theoren 
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Sol. Total number of pers 


Total number of ways of 


Since out of 4 persons ch 
chosen out of 3 men and 2 wo 
Number of ways of select 


and number of ways of selecti 


. Total number of ways : 


Required Prob: 


Ex. 7-77. A and B are twe 


1 
a problem correctly are ri an 


same answer. If the probabilit 


answer was correct. 
Sol. Let (A,) and (A3). be 
answer was correct. 
Two students can get the s 
(1) Both of them get the c 
(2) Both of them get the v 
Therefore, by total probat 
P(A,) = P (both students 
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respectively. 7 
Therefore, by the theorem of total probability, probability of selecting 1 girl and 2 boys 
Pi OO doe 
3232; BD 132° 
Ex. 7-76. Four persons are chosen at random from a group containing 3 men, 2 women 


: 10 
and 4 children. Show that the chance that exactly two of them will be children is a 


Sol. Total number of persons in the group. 


=34+2+4=9 
Total number of ways of selecting 4 persons out of 9 

=9 

=, 


Since out of 4 persons chosen exactly two are to be children, remaining two are to be 
chosen out of 3 men and 2 women i.e., 5 persons. 
Number of ways of selecting two children out of 4 


and number of ways of selecting two persons out of 5 


— 5 
Cy 


.. Total number of ways of having exactly 2 children in the selection of 4 
> "Gy : ey | 


‘cy 5c, 10 
Required Probability = —9 ; OL 
4 


Ex. 7-77. A and B are two very weak students of statistics and their chances of solving 


1 1 
a problem correctly are 3 and 7 respectively. They are given a question and obtain the 


| 
same answer. If the probability of a common mistake is 1001” find the chance that their 


answer was correct. 
Sol. Let (A,) and (A,) be the events that two students get the same answer and their 


answer was correct. 
Two students can get the same answer in following two mutually exclusive ways : 
(1) Both of them get the correct answer. 
(2) Both of them get the wrong answer committing a common mistake. 
Therefore, by total probability theorem, © 


P(A,) = P (both students get the correct answer) 
+ P (both students get the wrong answer committing a common mistake) 


1 1 1 1) 1 
S41 fea te 
8 12 8 12) 1001 
1 77 
—+ 
96 96.1001 
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Also P(A|A>) 


By compound probability theorem, 
P(A,A,) = P(A,) P(A /A;) 


P(A\A)) 
P(A) 


Therefore, P(A, /A,) = 


_ 1001 _ 13 


. 1078 14 
Ex. 7-78. A speaks truth in 75% and B in 80% of the cases. In what percentages of 
cases are they likely to contradict each other in stating the same fact ? 
Sol. A and B can contradict each other in following mutually exclusive ways : 
(i) A speaks truth and B does not, 
(ii) B speaks truth and A does not. 


Therefore, by theorem of total probability, probability that A and B contradict each 
other 


Je Meee ee 
=e + —e 
45 45 
ae 
20 
‘’ Prob. of A speaking truth = a 7 = 
100 4 
Prob. of B speaking truth = na nee 
100 5 


7 
Therefore, A and B are likely fo contradict each other in 0 x 100 = 35% of the cases. 


Ex. 7-79. The probability that a teacher will give an unannounced test during any class 


1 
meeting is 5" If a student is absent twice, what is the probability that he will miss at teas. 


one test ? 


Sol. The student will not miss any test, if on the two days he is absent, the teacher does 
not give any test. 


Probability for a teacher not giving any test on the two days (when the student is absent) 


.. Probability thatthe student will not miss any test 


PROBABILITY 


Since total probability is unity, , 


Ex. 7-80. A can solve 75% of the 
that either A or B can solve a proble 
Sol. Let (A) and (B) be the even 


Then P(A) 
and P(B) 
Now P(A +B) 


Therefore, probability that eithe: 


Ex. 7-81. A husband and wife a 


post. The probability of the husband: 


What is the probability that only one 
Sol. There are only two MLE. po: 

(1) Husband is selected and wife 

(2) Wife is selected and husband 

_ Since there are two vacancies, hu: 


Reqd. prob. : 


Ex. 7-82. A and B throw altern 
before B throws 6 and B wins if he thre 
the events that A wins and B wins the; 
AS’ and Bs turn to throw the dice, shc 
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cases. In what percentages of 
‘ame fact ? 
ually exclusive ways : 


that A and B contradict each 


vi 
0 x 100 = 35% of the cases. 


rounced test during any class 


‘lity that he will miss at teas, 


1¢ is absent, the teacher does 


i (when the student is absent) 
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_ 16 
25 
Since total probability is unity, probability that the student will miss at least one test 
pe 
wo 5s 0S 


Ex. 7-80. A can solve 75% of the problems and B can solve 70%. What is the probability 
that either A or B can solve a problem chosen at random ? 
Sol. Let (4) and (B) be the events that 4 and B solve the problem respectively. 


Then P(A) = 


and P(B) = 

Now P(A +B) = P(A)+P(B)- P(AB) 

P(A) + P(B)- P(A)P(B) 
3 7 21 

Seed me 


40 
Therefore, probability that either 4 or B can solve the problem 
eels 
40° 
Ex. 7-81. 4 husband and wife appear in an interview for two vacancies in the same 


1 1 
post. The probability of the husband’: selection is 7 and that of the wife's selection is 3° 


What is the probability that only one of them will be selected ? 
Sol. There are only two M.E. possibilities ; 
(1) Husband is selected and wife is not selected, 
(2) Wife is selected and husband is not selected. 
_ Since there are two vacancies, husband’s selection and wife’s selection are independent. 


1 I 1 I 1\ 1 
Reqd. prob. = 7 ene “F) 5 


eg O10 
35 3535 
_2 
= 


Ex. 7-82. 4 and B throw alternately a pair of unbiased dice. A wins if he throws 7 
before B throws 6 and B wins if he throws 6 before A throws 7. IfA and B respectively denote 
the events that A wins and B wins the series, a and b respectively denote the events that it is 
Asx and B& turn to throw the dice, show that 
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; P(A/ od 5 Baie 
(i) (A/a) 576 
ii P(A/b aE BA Ta 
(i) (A/b) 36 
5 
(iii) P(B/a) = ernie 
5 31 
; = —+—P(B/ 
(iv) P(B/b) BG a6 (Bla) 


Hence or otherwise find P(A/a) and P(B/a). Also comment on the result that 
P(A/a)+P(B/a)=1. 
Sol. Now with a pair of dice, 


prob. or throwi ee 
prob. or owing 36 


Pe er meee. 
and prob. of throwing 36 6 
(i?) P(A/a) = Prob. that A wins if he has to throw first. 
This is possible in following two mutually exclusive ways : 
(a) A throws 7 in its first trial. 
1 
Its prob. is 6 
_(b)A does not throw 7 in its first throw. Then B will have the turn of throw and hence 


prob. of A’s winning is P(A/b). 


1 
Prob. for this possibility = ( -t| P(A/b) 


5 
—P(Alb 
; (A/b) 


a By total prob. theorem 


1 5 
P(A/a) = 6 + gr ale) 
(ii) P(A/b) = Prob. that A wins if B is to throw first. 
This is possible only when B does not throw 6 in its first throw. Then A has the turn to 
throw. So prob of A’s winning is P(A/a). 
. By compound prob. theorem 
5 
(i -2 P(A/a): 


31 
—P(Ala 
736 (A/a) 


il 


P(A/b) 


Similarly others can be proved. 


” 


PROBABILITY 


Ex.’7-83. If 4 whole numb. 


chance that the last digit in the 


Sol. In all there are ten digi 
Digits 1, 3, 7 and 9 have the 
digit in the product is one of the 
Therefore, if the last digit ir 
whole number must have its last 
Probability for whole numt 


Therefore, by compound prc 
have their last digit 1, 3, 7 or 9 


Therefore, reqd. prob. 


Ex. 7-84. A six faced die is, 
as an odd number when thrown. 
the two numbers thrown is even 

Sol. Let p be the probability 

Then 2p is the probability fc 

When the die is thrown, eith 


2p+ p=Total p 


Prob. for an odd num 
Prob. for an even num 


The sum of the two number: 
numbers or odd numbers. Now p 


and probability that in two throw 


. By the theorem of total ; 
thrown is even 
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rent on the result that 


ays : 


ave the turn of throw and hence 


irst throw. Then A has the turn to 
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Ex."7-83. If 4 whole numbers taken at random are multiplied together, show that the 


16 
chance that the last digit in the product is 1, 3, 7 or 9 is 625" 


Sol. In all there are ten digits viz., 0, 1, 2, 3, 4, 5, 6, 7, 8, 9. 

Digits 1, 3, 7 and 9 have the property that when any two of them are multiplied, the last 
digit in the product is one of these four digits. 

Therefore, if the last digit in the product of four whole numbers is to be 1, 3, 7 or 9 each 
whole number must have its last digit 1, 3, 7 or 9. 

Probability for whole number to have its last digit 1, 3, 7 or 9. 


4 2 


“10 5 
Therefore, by compound probability theorem probability that all the four whole numbers 
have their last digit 1, 3, 7 or 9 


Therefore, reqd. prob. = 655" 


Ex. 7-84. A six faced die is so biased that it is twice as likely to show an even number 
as an odd number when thrown. It is thrown twice. What is the probability that the sum of 
the two numbers thrown is even ? 

Sol. Let p be the probability for an odd number. 

Then 2p is the probability for an even number. 

When the die is thrown, either even number turns up or odd number turns up. 


2p + p= Total prob, = 1. 


ae 
EG 
] 
Prob. for an odd number = a 
2 
Prob. for an even number = 3" 


The sum of the two numbers thrown will be even if in both throws either we get even 
numbers or odd numbers. Now probability that in two throws even number turns up 


and probability that in two throws odd numbers turn up 


7 re 


*. By the theorem of total me probability that the sum of the two numbers 
Gown is even 
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4 1.5 
sh lane =—-—+-—=-—, 
. 9 9 9 
Ex. 7-85. A and B throw with a pair of dice. A wins if he throws 6 before B throws 7 and 


30 
B wins if he throws 7 before A throws 6. If A begins, show that his chance of winning is 61 


Sol. Let (A) and (B) be the events that A gets 6 and B gets 7 with a pair of dice respectively. 
5 6 1 
= —and P(B)=— =-. 

Then P(A) PS) (B) 36° 6 
1 5 


= 5 31 = 
Therefore P(A) = ioe ae EB Share 


Since 4 begins, he can win in following mutually exclusive ways : 
| (A), (ABA) (ABABA) t.eccscccsesseseeee 
Therefore, by theorem of total probability, probability that 4 wins 
= P(A) + P(ABA) + P(ABABA)-+... 
Since throws are independent, by compound probability theorem, probability that 4 
wins 


= P(A) + P(A)P(B)P(A) + P(A) P(B)P(A)P(B)P(A)+... 


5 31 5) (31 sy 
ee ae ee a 
36 36 6) \36 6 


os Ones 
~ 36°31 5 
36 6 
_ 30 
61 


<. Ex. 7-86. A and B take turns in throwing two dice, the first to throw 9 being awarded 
the prize. Show that their chances of winning are in the ratio 9 : 8. 
Sol, Let (A) and (B) be two events that A and B get 9 in a throw respectively. 
1 


4 
Th P(A) = P(B)=— =— 
en (A) = P(B) E18 


~ 2 1 8 
Therefore P(A) = cal ear ait 


, Since A begins, he can win in following mutually exclusive ways : 
(A), (ABA), (ABABA). 
Therefore, by theorem of total probability, probability that A wins 


PROBABILITY 


Since total probability is unit 


Therefore, chances of winnir 
Ex. 7-87. A, B and C in orde. 
their respective chances of winni. 
Sol. Let (A), (B) and (C) be t 


Then PC. 


P(, 
Since A begins, he can win ir 


(A), (A 
Therefore, by theorem of tote 


= P(A) + P(AB( 
= P(A)+P(A)P 


B can win in following mutua 


. Probability that B wins - 
= P(AB) + P(ABCAB) + P(z 
= P(A)P(B) + P(A)P(B)P(C 


.. Probability that C wins 
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hrows 6 before B throws 7 and 


his chance of winning is ai 


with a pair of dice respectively. 


ive ways : 
at A wins 


y theorem, probability that A 


rst to throw 9 being awarded 
9:8. 
. throw respectively. 


ve ways : 


at A wins 
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eee 


Therefore, chances of winning are in the ratio 9 : 8. 

Ex. 7-87. A, B and C in order toss a coin. The Jirst one to throw a head wins. What are 
their respective chances of winning assuming that the game may continue indefinitely ? 

Sol. Let (A), (B) and (C) be the. events that A, B and C get head in a toss respectively. 


Then P(A) = P(B)= P(C)= 3 
A) = P(B)= P(C)=1-1=1 
ee, rene ae 


Since A begins, he can win in following mutually exclusive ways : 


teecceees 


4 7 
ae 1 
ee es nes es ee 
ae 2 
: 1 iS ae oe Pe a 
| ee ee | lan Pan fe ae 
2 2 2 2; 7 


B can win in following mutually exclusive ways : 


sees 


(AB), (ABCAB), (ABCABCAB), 
.. Probability that B wins - 


= P(AB)+ P(ABCAB) + P(ABCA BCAB)t..... 
= P(A)P(B) + P(A)P(B)P(C)P(A)P(B) +{P(A)P(B)P(C)}? P(A)P(B) +... 


: (3) ; (4) (4) Ge 


.. Probability that C wins 
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Ex. 7-88. A and B toss a coin alternately on the understanding that the first to obtain 
1 


2 
head wins the toss. Show that their respective chances of winning are 3 and 3" 


Sol. Let (A) and (B) be the events that A and B get head in a toss respectively. 
1 
Then P(A) = P(B)= 5 


1 

2 

Since A begins, he can win in following mutually exclusive ways : 
(A), (ABA), (ABABA),.......+. 

Therefore, by theorem of total probability, probability that A wins 
= P(A) + P(ABA) + P(ABABA)+... 


= P(A) + P(A)P(B)P(A) + {P(A)P(B)}? P(A)+-. 


2 i 1 
P(A) = P(B)=1-5 = 


1 (: nea 
2 \2 2 
i 
ay ae 
it? 
4 
.. Probability that B wins 
ee ee 
3 

Ex. 7-89. A coin is tossed three times. Find the chance that head and tail will show 


alternately. 
Sol. Let (H) and (7) denote the occurrence of head and tail in a toss respectively. There 


are two possibilities : 
(H)(T)(H) 
and (T)(A)(T) 
Probability of either possibility 


Therefore, by total probability theorem probability of having head and tail alternately 
1 1 1 


8 8 4 
Ex. 7-90. A and B are two independent witnesses (i.e., there is no collusion between 
them) in a case. The probability that A will speak the truth is x and the probability that B 
will speak the truth is y. A and B agree in a certain statement. Show that the probability that 
this statement is true is 


xy. 
1-x-—yt2xy 


PROBABILITY 


Sol. Let (A,) and (A,) t 
statement is correct respectivel 


Then P 

and P(A 
By compound probability 1 
P(A 

P(A). 


Ex. 7-91. A coin is tossed 
consecutive heads is 


Sol. Let p,, denote the pre 
| Two mutually exclusive ways ¢ 
| (i) m consecutive heads o 
| (ii) Only (m—1) consecut 
| consecutive heads, head must ¢ 

Probability of possibility ( 
. and probability of possibility ( 

in last m trials). 


By theorem of total proba 


or 
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nding that the first to obtain 


2 
; —and—. 
ning are 3 3 


na toss respectively. 


ve ways : 


at A wins 


: that head and tail will show 


iil in a toss respectively. There 


aving head and tail alternately 


there is no collusion between 
is x and the probability that B 
. Show that the probability that 
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Sol. Let (A,) and (A,) be the events that A and B agree in a statement and their 


statement is correct respectively. 


Then P(A,) = x.y+(1—-x)-y) 
= |-~x-y+2xy 
and P(A,|A,) = xy 


By compound probability theorem, 
P(A,A) = P(A,)P(A/A,) 


P(A\A,) 
P(A,/A,) = 


P(A) 
pene) Sere 
~ l-x~-y+t2xy 

Ex. 7-91. A coin is tossed (m+n) times (m>n). Show that the probability of getting m 
consecutive heads is 


n+2 
qmtl ‘ 


Sol. Let p,, denote the probability of getting m consecutive heads in (m+) tossings. 
Two mutually exclusive ways of having m consecutive heads in (m+n) tossings are : 
(i) m consecutive heads occur in (m+n-—1) tossings. 
(ii) Only (m-1) consecutive heads occur in (m+n -1) tossings. In order to have m 
consecutive heads, head must appear in (m-+n)th toss and there must be a tail in mth toss. 
Probability of possibility (i) = p,_; 
and probability of possibility (ii) = (Probability of tail in nth trial) x (Probability of heads 


in last m trials). ; 
1/f1it ; 
ees oe m times 
2082-2 


By theorem of total probability, 


Py = Pn-i* Smet 


i 
or gimtl = Pn Pa-i = Pa-i ~ Pn-2 == Pi ~ Po 
n 
Py = Po m ginel 
Po = probability of getting m consecutive heads in (m + 0) 


tossings 
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ee A TIS 


Ph = yn pmai 


n+2 


gintl 


Ex. 7-92. (a) What is the probability that two numbers chosen at random will be prime 
to each other ? 


(b) Four positive integers are chosen at random. Find the probability of their having 
a common factor. 


Sol. (a) Let ‘a’ and ‘b’ be any two numbers and ‘r’ a prime number. When ‘a’ is divided 
by ‘r’ the possible remainders are 


0, 1, 2,....(7-1) 
Therefore, probability that ‘a’ is divisible by ‘7’ 


1 
i.e., probability of getting zero remainder = = 


1 
Similarly probability that ‘b’ is divisible by ‘7’ = : 
By compound probability theorem, 


I] 
probability that ‘r’ divides both ‘a’ and ‘b’ = = 
. r 


Therefore, probability that ‘a’ and ‘b’ don’t have a common factor ‘7’ 


(-3 


Therefore, probability that,‘a’ and ‘b’ are prime to each other 


= I('-3} ree 8 Sy 


r 


20 
a 


(b) Let a, b, c and d be four integers and r a prime number. 
Then prob. that all the four numbers a, b, c, dare divisible byr 


I 
~ os 
; 


Therefore prob. that a, b, c, d do not have r as a common factor 


(3 


Therefore, prob. that four integers do not have any common factor 


1 1 
-{] acy r=2,3,5, 7.0... 


PROBABILITY 


Therefore, prob. that four 


Ex. 7-93. Cards are dealt 
Show that the probability that 


If cards continue to be dea 
exactly r cards are dealt in all 


Sol. Since n cards are to b 
from the remaining 48 cards le 
.. Probability for drawing 


Evidently (n+ 1)th card n 
(52-n) cards. 
Probability for having (n- 


By the theorem of compot 
first ace 


Out of r cards dealt before 
(n+1)th draw. 
_Number of ways of dealir 


. Probability of drawing 


Evidently (r+ 1)th cardr 
(52-—r) cards. 
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osen at random will be prime 
1e probability of their having 


‘number. When ‘a’ is divided 


on factor ‘r’ 


ther 


‘actor 


n factor 
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90 


n 


Therefore, prob. that four integers have a common factor 


90 
= 1-—. 
n4 


Ex. 7-93. Cards are dealt one by one from a well shuffled pack until an ace appears. 
Show that the probability that exactly n cards are dealt before the first ace, is 
(S1-n)(S0—n)(49 —n) 
13.51.50.49 
If cards continue to be dealt until a second ace appears, prove that the probability that 
exactly r cards are dealt in all before the second ace, is 
r(S5l—r)(S0-r) 
1317.50.49 © 
Sol. Since n cards are to be dealt before first ace appears, these n cards are to be dealt 


from the remaining 48 cards leaving aside 4 aces. 
.. Probability for drawing n cards (not containing any ace) from a pack 


4 
ee 


Evidently (7 +1)th card must be an ace to be drawn out of 4 aces present in remaining 
(52-n) cards. 
Probability for having (n +1)th card an ace 


By the theorem of compound probability, probability of dealing exactly n cards before 
first ace 


Cn a 


ae 52-n C 


_ Gl-njS0-1n)(49—n) 
7 13.51.50.49 
Out of r cards dealt before a second ace appears, one card is an ace which was drawn in 
(n+1)th draw. 


_Number of ways of dealing r cards containing one ace 
_4 
sd 


.. Probability of drawing r cards containing one ace 


Evidently (r +1)th card must be an ace to be drawn out of 3 aces present in remaining 
(52 —r) cards. 
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Probability of having (r +1)th card a second ace 


3 
Cc} 


52-r ro 


' Therefore, by compound probability theorem probability of dealing exactly r cards 
before second ace : 


= 52 "52=r 
ce C} 


_ rl-r)(50-r) 
~ 1317.50.49 
Ex. 7-94, Cards are dealt one-by-one from an ordinary pack (without replacements) 
until two aces have appeared. Find the most probable number of cards to be turned up. 
Sol. Let x be the total number of cards dealt until the second ace appears and P(x) be 
its probability. 
Evidently xth card must be an ace and among remaining (x—1) cards there must be one 
ace. 
Number of ways of drawing (x—1) cards including one ace 


_ 4 48 
~ Cy" Cy_2 


. Prob. of drawing (x-1) cards including one ace 


4 
Cy Cy-2 


= 52 
Cy-1 


Since xth card, which is to be an ace, is to be drawn from 52-(x-1) =53-.x cards 
containing 3 aces, prob. that xth card is an ace 


ox) 
53-x 
Therefore, by compound probability theorem, 
4¢, Bo 3 
P(x) = 32 c, 53-x 


_ &-)G2-x)S1-~x) 
: 13175049 
Most probable number of cards is that value of x for which 
P(x-1) < P(x) > P(x+1) 


Consider P(x-1) < P(x) 
‘ (x —2)(53 - x)(52- x) . (x—1)(52—x)(51=x) 
ace 1317.50.49 1317.50.49 
Pe) 
Le., 3 

Consider P(x) > P(x+1) 


PROBABILITY 

; (x-DI 

ie. — 
; 1 

ié., 


Therefore, most probabl 


Since x is to be an integ¢ 


. Most probable numbe 


Ex. 7-95. Of three indep 
the prob. that the second onl 


l 
happen is 7 Obtain the ui 


Sol. Let A,, A, and A, 


Now P(A 
ie., P(A, )P(A- 
ie, p\(- p2) 

PU: 
ie., (1- p, )(p2) 
and . PC 
i.e, (l- p,)(l 


From (1), (2) and (3) 


Py 


Ex. 7-96. An integer is 
the prob. that the integers ¢ 
Sol. Let A and B be the: 


Since 6x 33 =198 anc 
divisible by 6 and 8 respec! 
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of dealing exactly 7 cards 


ack (without replacements) 
of cards to be turned up. 


id ace appears and P(x) be 


-1) cards there must be one 


52-(x-1) =53-x cards 


i1~x) 
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(x -1)(52 — x)(51- x) . x(51— x)(50— x) 
13.17.50.49 13.17.50.49 


: 52 
Lé., x>— 


3 . 
Therefore, most probable number x of cards is such that 


Lé., 


Since x is to be an integer, 
x=18 
.. Most probable number of cards dealt 
= 18. 


1 
Ex. 7-95. Of three independent events the prob. that the first only should happen is a 
1 
the prob. that the second only should happen is 3 and the prob. that the third only should 


1 
happen is D Obtain the unconditional probabilities of the three events. 


Sol. Let A,, A, and A, be the events and p,, p2, p3 be their unconditional probabilities. 


Ale 


Now P(A, A,A3) = 


ie, P(A,)P(A,)P(A3) = 


(A, B,C are independent) 


1 
iLeé., p\(- p2)U- p3) = Fi . (1) 
Se fas 1 
P(A,A,A3) = 8 
1 
L.e., (1— p,)(p2)U - p3) = 8 ..(2) 
foes 1 
and P(A,A,A3) = DD 
1 
be:, (-p)U- Pa)P3 = 75 .-.3) 
From (1), (2) and (3) 
ae aoe ait ae 
Pr = 59 P2- 3 P3~ 4 


Ex. 7-96. An integer is chosen at random from the first two hundred integers, what is 
the prob. that the integers chosen is divisible by 6 or 8 ? 

Sol. Let A and B be the events that the number chosen is divisible by 6 and 8 respectively. 

Since 6x 33 =198 and 8x25 = 200, there are 33 and 25 numbers upto 200 which are 
divisible by 6 and 8 respectively. 
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33 25 
”. P(A)=——, P(B)=—.. 
50" 200 


Now L.C.M. of 6 and 8 = 24. 
The number which is divisible by 24 is divisible by 6 and 8 both. 
.. The number of numbers which are divisible by 6 and 8 both 


200 
= greatest integer less than 


“24 
= 8. 
8 
P(AB) = 200 
Now prob. that the integer chosen is divisible by 6 or 8 
= P(A+B) 
= P(A)+ P(B)— P(AB) 
3325 8 
~ 200" 200 200 
50.1 
=a 


Ex. 7-97. [fn integers taken at random are multiplied together, show that (a) the chance 
n 


2 
that the last digit of the product is 1, 3, 7 or 9 is (2) > (b) the chance of its being 2, 4, 6 or 


_ 4" -2" : _ 5” -4" : : 10” —8” —5" 44" 
8 is 5 +(c) ofits being 5 is 0" ; and (d) ofits being ‘0’ is SS ger : 


Sol. (a) The last digit in the product will be 1, 3, 7 or 9 if each of the n integers end with 
either of these four digits. Now since there are in all 10 digits, prob. of selecting these four 
digits 

meee 
~ 10 
This is also the prob. for a integer to end with either 1,3, 7 or 9. 


. By compound prob. theorem, prob. that the last digit in the product of n integers is 
1, 3, 7 or 9 = prob. that all m integers end with 1,3, 7 o0r9 


-(3)-6) 


(6) The last digit in the product will be 1, 2, 3, 4, 6, 7, 8 or 9 iff each of the n integers 
end with either of these eight digits. 
.. Prob. for an integer to end with either of these 8 digits. 


d AY 
. Prob. for the last digit in the product of n integers to be 1, 2, 3, 4, 6, 7, 8 or 9= (=) . 


PROBABILITY 


Now the prob. for the last 


{from (a)}. 
.. By total prob. theorem, 
4" = 2" 
2, 4, 6, or 8 = on 


(c) The last digit in the p1 
with 1, 3, 7 and 9. 
Now prob. that integers 


.. Prob. that out of 7 integ 


n 
= (=) — prob. that all intege 
(d) Since total prob. is un 
= 1-(prob. that last digit ir 
= 1-(prob. that last digit ir 
~(prob. that last digit in th 
—(prob. that last digit in th 


Ex. 7-98. n letters to each ¢ 
at random. (i) What is the pro 
What is the probability that exi 


Sol. Let U,, be the numbe 
There are two mutually ex 
(1) If any two letters occur 
wrong in U,,_, ways. Since 01 
which all the letters can go wrc 
(2) If one letter occupies : 
(n-1) ways, the remaining (n 


Therefore, number of way: 


or U,,- 


n 


Change 2 to n—1,n-2,... 
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', Show that (a) the chance 
ance of its being 2, 4, 6 or 


10” —8" -5" +4" 
10" 

. of the 7 integers end with 

ob. of selecting these four 


‘0’ is 


x9, 
te product of n integers is 


' iff each of the n integers 


4 n 
3,4,6,7,80r9=( 5] . 
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2 n 
Now the prob. for the last digit in the product of n integers to be 1, 3, 7 or 9 = (2} 


5 
{from (a)}. 
.. By total prob. theorem, prob. of having the last digit in the product of n integers to be 
4" 2” 
2, 4, 6, or 8 = aaa 
5 


(c) The last digit in the product will be 5 iff at least one integer end with 5 and others 
with 1, 3, 7 and 9. 
Now prob. that n integers end with 1, 3, 5, 7 or 9. 


5 n 

: (3) | 
.. Prob. that out of ” integers at least one integer ends with 5 and other with 1, 3, 7 or 9 
4)-=* 
10 10% - 
(d) Since total prob. is unity, prob. that the last digit in the product is ‘0’ 
= 1-(prob. that last digit in the product is 1, 2, 3, 4, 5, 6, 7, 8 or 9). 
= 1-(prob. that last digit in the product is 1, 3, 7, or 9 


~(prob. that last digit in the product is 2, 4, 6 or 8) 
~(prob. that last digit in the product is 5) 


ay [Bagh 57 4" 
Pope eeeneiae 
(is) = "10" 
1 
10" 
Ex. 7-98. n letters to each of which corresponds an envelope are placed in the envelopes 


at random. (i) What is the probability that no letter is placed in the right envelope ? (ii) 
What is the probability that exactly r letters are placed in the right envelope ? 


5 


—_ : n 
5 
= (3) — prob. that all integers end with 1, 3, 7 or 9 = (=) -( 


{10” —8" -5"4+4"}. 


Sol. Let U,, be the number of ways in which all the letters can go wrong. 
There are two mutually exclusive possibilities : 
(1) If any two letters occupy each other’s position, the remaining (n—2) letters can go 


wrong in U,,_, ways. Since one letter can go wrong in (n-1) ways, number of ways in 
which all the letters can go wrong = (n-1) U,_». 

(2) If one letter occupies another’s envelope and not vice-versa which can happen in 
(n—1) ways, the remaining (n-1) letters can go wrong in U,,_, ways. 

Therefore, number of ways in which all the letters can go wrong = (n—1)U,_,. 

U,, =(n-1) (U,_, +U,_2} 
or U,, —nU,_, =-(U,_, —(n-D) U,_>} 
Change n to n-1,n—2,.....3 
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aes —(n- )U,,-2 ae {U2 —(n-2) U,,_3} 
U,-2 —(#-2) U,_3 =-{U,_3 -(@-3) U,_4} 


U; -3U, = -{U, ~2U;} 
Multiplying 
U, -nU ,-4 =(-1)" {U, -2U)} 
U, = number of ways in which one letter out of one can go wrong 
and U, = number of ways in which two letters out of two can go wrong 
=1, 


Therefore, U, —nU,_, =(-1)""? =(-1" 


or Bie 2 Uni SGD 
n! (n-D)! n} 


Change n to n~-1,n—2,....,2 


Un-2 U,-3 ene 
(n-2)! (n-3)! (n-2)! 


PreeeeeUeT SUSI ETerer erect errr 


Uy U,_(-1) 

7) as ane 

_yyr-t _y" 

Ra ee ese wiotee pe ae 
n! 2! 3! (n-D! n! 


Total number of ways of distributing 7 letters in n envelopes = n! 


U 1 -1)" 
Therefore, reqd. prob. = — = ae —+..4 Sy 
ni 2! 3! n!} 


(ii) r letters can be chosen out of 7 in "c, ways and rest can go wrong in U,,_, ways. 


n 
U,-+ 
Therefore, reqd. prob. = tet 
n} 


1/1 1 (-b"" 
= —4—-T +t 
r!}2! 3! (n-ry)! 


PROBABILITY 


Ex. 7-99. A player tosses 
two for every tail. He is to pla 
attaining exactly n, show tha 


and hence find the value of | 
Sol. Two mutually exclu 
(1) when score is (n-1) 
(2) when score is (n- 2° 


The probabilities of these 


1 
and 5 Pr-2 
Therefore, by total prob 


or 
or (Py 
or (Pp 
Cha 
(Pn. 
(Pn. 
Multiplying p 


Score ‘2’ can be attaine 
(1) tossing tail in the fu 
(2) tossing heads in firs 


Therefore, 
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9 wrong 


230 wrong 


=n! 


n go wrong in U,,_, ways. 
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Ex. 7-99. A player tosses a coin and is to.score one point for every head turned up and 


- two for every tail. He is to play on until his score reaches or passes n. If p,, is the chance for 


attaining exactly n, show that 
1 
Pn = 2 (Prt + Pn-2) 


and hence find the value of p,. - 
Sol. Two mutually exclusive possibilities of attaining score exactly n are : 
(1) when score is (1-1), player tosses head. 
(2) when score is (n—2), player tosses tail. 


l 
The probabilities of these two possibilities, by compound probability theorem, are 5 Pn-t 


! 
and 4 Pn-2- 
Therefore, by total probability theorem, 
l l 
Pn = 3 Pr-k +5 Pn-2 


1 
3 (Pn + Py-2) 


or 2Pn = Pn-t + Py-2 
or (Pp =P) = —(Pa-1 — Pn-2) 
| 
or (Px — Pn) = ~) (Pa-1 — Pn-2) 
Changing n ton-1,n—-2,...3 
i 
(Ph-i x Pn-2) = (-4} (Pr-2 - Pn-3) 


AA) 
(Pn-2 — Pn-3) = (-5] (Pr-3 —Pn-a) 


1 
(P3 - Pr) -(-] (P2 - Pi) 


1 n-2 
Multiplying Pn-Pn-1 = (-7] (P2 - Pi) 


Score ‘2’ can be attained in following two mutually exclusive ways : 
(1) tossing tail in the first trial. 
(2) tossing heads in first two trials. 


Therefore, Py = 
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1 
' Also p) = 5 (probability of tossing head in first trial.) 


2 
Adding above equations (1) 


1 n 
Therefore, Pn ~ Pn-) = (-+) 


Pn P2 
‘ 3 
r Pi reslers 
or . Fi 
2Pn + Py = 

Adding (2) and (3) 
3p, 
Therefore, Pr 


Ex. 7-100. Each of the n urns contains a white and b black balls. One ball is transferred 
from the first urn into the second, then one ball from the latter into the third and so on. 
Finally one ball is taken from the last urn, what is the probability of its being white ? 


Sol. Let p, be the probability of drawing a white ball from the Ath urn. 


There are two possibilities : 


(1) A white ball is transferred from (k -1)th urn to Ath urn. 
(2) A black ball is transferred from (k —1)th urn to Ath urn. 


! ee 
#{)2+c0 ral 


In (1) number of white balls in Ath um = a +l. 


Therefore, conditional probability of drawing a white ball from Ath urn when a white 
ball is transferred from (k —1)th um to Ath um = 


Probability of drawing a white ball from (k -1)th urn = Pr-1- 
Therefore, by compound probability theorem, probability of drawing a white ball from 


Ath urn. (If possibility (1) happens) 


(If possibility (2) happens) = 


at 


a 


a+l 


a+i 


=—__ Pr-i: 
a+b+1 
Similarly probability of drawing a white ball from Ath urn. 


i 
mak Pr) 
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at+b+1 
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Therefore, by theorem of total 
1 1 

Ta Pmi ty Pi 

Put A 

P; 


But p, = probability of drawi 


Therefore, P: 


Similarly, p 
In general, 


P, 


Ex. 7-101. Jn a lottery m-tickei 
and returned before the next drawi 
each of the numbers 1, 2....n will a, 


Py =1-" a{1- 


Sol. Let (A,), (Ay) yeoeessees (A, 


appear at least once in & drawing. 


numbers, 1, 2,.....2 respectively do 
By additive law, 


Probability of appearance of it 


Therefore, probability of non-: 
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Therefore, by theorem of total probability 


1 1 _ atl jen, soe a- ) 
| Ty Pni ty Pe Gabe aebel 
= ——— p,_, + 
a+b+1 a+b+1 
Put k =2 
Bi ia ges Seo 
(2) : abel at+b+1 
. But p, = probability of drawing a white ball from first urn. 
= a 
at+b 
Theref Pa 
Panes a atb+l ath atb+l 
mee: 
...(3) atb 
Similar! Ps = —— and 
imilarly, Toe and so on. 
In general, 
_ a 
Pn a+b 
Ex. 7-101. Jn a lottery m-tickets are drawn at a time out of the total number of n tickets 
Ils. One ball is transferred and returned before the next drawing is made. Show that the probability that in k drawings 
into the third and so on. each of the numbers 1, 2....n will appear at least once is 
y of its being white ? k k k 
the kth urn. 2 py =1-" a(1-2] <3 e{1-2} [i- ” ae 
| n n n-l 
Sol. Let (A,), (Aq),.....006 (A,,) denote the events that the numbers, 1, 2,....2 respectively 


appear at least once in k drawing. Then (A,), (A, ), sessees (A,,) denote the events that the 


| numbers, 1, 2,.....1 respectively do not appear in k drawings. 


om kth urn when a white By additive law, 


PA, HAgt ince: Ay) = >) P(A))— )) P(A), tee 
f= 


le : i i,j=l 


i<j 
drawin, hite ball fr 
Sees vane Probability of appearance of ith number in one draw 


m 


n 
Therefore, probability of non-appearance of ith number in one draw 


(4 
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Since tickets are replaced after each draw, draws are independent and hence by compound 
,probability theorem 


p(A,) = Probability of non-appearance of ith number in k drawings 
m\ 
7 (4 
n 
: mY 
Y P(A;)="e,/1-—] . 
TREO: (A;) aif ") 


i=l 


Probability of non-appearance of any two specified numbers in one draw 


Ne, 


(n—m)(n—m—]) 
n(n—-1) 


Therefore,  ; jal 
i<j 

Similarly, 
a m m Z m 

P(A,A;A,) ="e3| L-—| J 1- 1- 
p> (AiA jv) of my ( salt a 
i,j,k=l 
i<j<k 
and so on. 


Therefore, P(A, + A,+...+A,,) 


eft) (1-2 (1 ) 
Cc _——_— oo eae ~—s we cerce 
2 4 n n-} n-2 


Therefore, P(A, A3...A,) =1- P(A, +...+A,) 


PROBABILITY 


Which is the required pre 
Ex. 7-102. We have k vari 
objects. These objects are dra 


that the probability p, thatn 
all varieties is given by 

k""p, = (k _ 

Sol. In the first draw ther 

drawing (k-1L) varieties in (i 

probability of at least one out o 

Let (B,),(Bs),....(By_y) 

are absent respectively. Evide 

Therefore by additive lav 


PCB, + B+ ene eene’ +By_))= 


If (B,) happens, out of | 


drawn in first draw) are missing 
above) in a draw 


Therefore, P(B,) = Probz 
draws 


Similarly P(B,) = P(B3) 
If (B,B,) happens out oj 


drawn in first draw) are missi1 
Therefore as above, 


PG 
and so on. 
o PCB, + By... +B,_)) 


and so on. 
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ndent and hence by compound ; . : ( = ( i J (1 an ) 2a, 


n-2 


joe 


Fad po) 


EGrawIUES Which is the required probability. 


Ex. 7-102. We have k varieties of objects each variety consisting of the same number of 
objects. These objects are drawn one at a time and replaced before the next drawing. Show 


‘ that the probability p,, that n and no less drawings will be required to produce objects of 
all varieties is given by 
kp, = (kk 2 4 0 (K-38). 

yers in one draw Sol. In the first draw there will be necessarily one variety, therefore, the probability of 
drawing (k ~—1) varieties in (n-1) drawings is to be obtained. To proceed with firstly the 
probability of at least one out of (k — 1) varieties missing in (n—1) drawings will be obtained. 
Let (B,),(B2),.-..(B,_,) denote the events that first, second..... atk ~1)th varieties 

‘ are absent respectively. Evidently events (B,),(B,)...(B,_;) are non-mutually exclusive. 

Therefore by additive law, 


k-1 k-1 , 
PCB, + By +er.eee+By1) = P(B;)~ )) P(B)Bj) #00 
i=! 


: ij=l 
1and jth numbers in k drawings i<j 


If (B,) happens, out of k varieties two varieties (first variety and vari vhich was 
drawn in first draw) are missing. Probability of not drawing any of the two varie: mentioned 


above) ina draw 
ee, 
lk 


Therefore, P(B,) = Probability that two varieties (mentioned above) are absent in (1 ~ 1) 
draws 


(it) | (2 
K n-2 ‘ k 


k~2)" 
Similarly P(B)) = P(B3) =...= P(B,_) = (#7) 


a 


k k If (B,B,) happens out of k varieties three varieties (fizs', second and one which was 
7 "| (1 Pipe drawn in first draw) are missing.. 


7 a-l Therefore as above, 
k k : n-1 
n m f k-3 
—|j]1- an = P(B,B3) =...= (4?) 
“I =| Py) ne k 
and so on. 
n-l i n-} 
= k-2 - k-3 

m . m ie P(B, + B, seceses +B,_1) _ : Cy (4? al oo} Se 
3| l-— | | l- —— : k k 

n n-l 


and so on. 
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Also probability that in (n—1) draws the variety that has been drawn in first draw is 


l n-l 
absent = (:-2) 


1 n-1 
. Reqd. prob. = (1 - i} ~ P(B, +B, +...+By_1) 


k-1 n-| _ k-2 n-l : k-3 n-l 
wme(*) -Mal*P) +ha(AA] 


or kp, = (k-1)! =! 6 (k- 2)" 1 4*! 6 (k= 3)". 

Ex. 7-103. Two similar decks of n different cards are put into random order and matched 
against each other. If a card occupies the same position in both the decks we speak of match 
(coincidence). Find the probability of at least one match. 


Sol. Assume that cards of one deck be in natural order. Let (A;) be the event that a 


match occurs at the ith place. 
By additive law, 


n n 
P(A, +A2+...+A,) = > P(4,)- Si P(A,A t(D" P(A, A}...A,) 
i=l i<j=l 
If event (A;) happens, a match occurs at the ith place, i.e., in second deck ith numbered 
card is at the ith place while the remaining (n-1) cards may be in any order. 
Number of ways of distributing (1-1) cards on (n—-1) places 


= (n-1) 
and total of number ways of distributing n cards on n places 
=n! 
(n-l)! 1 
P(A;) = poe 
n!} n 
2x. PIAA (n-2)! 1 
| Similarly, jAj) = at AnD 
and so on. 
1 
P(A, + Ag t..eeceseeeee +A,)="¢)-—- 
(A, + Ap ii aaa nn) 
+ "Cc, Fo ens te att Zs 
~ n(n—-WD(i-2) nt! 
n-1 
He hee 
2! 3! n! 


Ex. 7-104. What is the probability that at least one of the players in a bridge game will 
get a complete suit of cards ? 

Sol. Let (A,),(A3),(A3) and (A,) denote the events that four players respectively get 
a complete suit of cards’ 


PROBABILITY 


P(A,) = Probability the 


as there are four suits and pl: 


Similarly P(A,) = P(A; 


P(A,A;)= Probability 


Similarly P(A 
and “ P(A\A 
P(A, +A, +A 
4 
4 
= cy bs "5 
Zo, 
iid 
= ig 
(52 
_ 16.1313 
Which is the required p: 
Ex. 7-105. Show that 
P(E 
Sol. By compound prot 
Since 
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been drawn in first draw is 


random order and matched 
‘he decks we speak of match 


et (A,;) be the event that a 


yt! P(Aj Ag. An) 


n second deck ith numbered 
> in any order. 
aces 


n(n-1). 


layers in a bridge game will 


our players respectively get 
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P(A,) = Probability that player gets a complete suit of cards 


as there are four suits and player is to get one. 


4 
Similarly P(A,) = P(A;) = P(A,) = z 
C13 


P(A,A;)= Probability that ith and jth player get complete suit of cards 


Cy Cj 
Fes ers 
i=1,2,3,4 
J=1,2,3,4 andi 4 j 
; 40 36, 2, 
Similarly PAA) = BiH, PRL DSA GIA Ak 
C13 C13 
‘ 
: Cc) +6) Cj 1 
and P(A,A,A3A4) = 28a a 
C13 C13 C13 C13 


P(A, +A, +A;+Ay) = yr )- SPA, A;) 


i<j=l 


4 
+ » P(A;A Ay ) -P(A;A2A3A4) 


i<j<k=l 
4 c c C; “c,.“e 
JAG L 4e, ber a a 1 ¢ I 
52 52 39 52 39 26 
13 C13 13 Cy3- C13 


: 2 
= 16,13!39! _ 6.4.3.3!) 26! + 4.43.2 4.432 3n4 24 C39 (13!)4 
| RS (52)! (52)! (52)! (52)! 


| _ 16.13!39!- 72(13!)726!4 72(13!)4 

7 52! 
Which is the required probability. 
Ex. 7-105. Show that 
| P(AB) S$ P(A) < P(A +B) S$ P(A) + P(B). 
Sol. By compound prob. theorem, 

P(AB) = P(A)P(B/ A) 
Since : P(B/ A) <1, P(AB) s P(A) .(1) 
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Since AB, AB, AB are mutually exclusive forms in which an event (A + B) can happen, 
by total prob. theorem, 
P(A+ B) = P(AB) + P(AB) + P(AB) 


Similarly P(A) = P(AB) + P(AB) 

P(A+ B) = P(A) + P(AB) 
Since P(AB) = 0, P(A) < P(A+B) (2) 
Also P(A+B) = P(A) + P(B)- P(AB) 
Since P(AB) 2 0, P(A+ B) < P(A) + P(B) (3) 
Result follows from (1), (2) and (3). 


Ex. 7-106. State and prove Bayes theorem. 
Sel. Statement. If an event E, can only occur in combination with one of the mutually 


exclusive events E,, E......... E,,, then 
P(E,)P(E/ E, 
CA ee k= 1, Quan 
; >) PCE) )P(E/ E;) 
i=l 
Proof. Since the events E can occur only with the events E,, Ey,......... E,,, the possible 


forms in which E. can occur are 


These forms are mutually exclusive as the events £ are mutually exclusive. 
.. By total prob. theorem, 


P(E) = P(EE,) + (EE) +....P(EE,) 


n n 
= > P(EE;)=)_ P(E; )P(E/E;) 
i=l isl : 
‘ (using compound prob. theorem) 
Now by compound prob. theorem : 


P(EE,) = P(E)P(E,,/ E) = P(E,)P(E/ E,) 


P(E, )P(E/E,) 


P(E, | E) = P(E) 


_~ P(E, )P(E/E,) 


n 
>) P(E; )P(E/ E;) 
i=l : 7 
Note. The probabilities P(E,) and P(E, /E) are known as ‘priori’ and ‘posteriori’ 


probabilities. 
Thus, Baye’s theorem can be stated as : 
‘If an event E can occur only in combination with the mutually exclusive events 


E,, E,,...E, .and if 


san ga SE atts 


PROBABILITY 


(i) the priori probabilities 
of knowledge regarding the oc 
Pd 
are given, the posteriori } 
P(E, / E), P(E, / BE)... 

are given by 


PCE 


Ex. 7-107. A bridge play. 
the two of them. Each oppone 
three-two split on the hearts (. 


Sol. Let E,: the event th: 
E,: the event the 
then P(E, /E,) isto be obtai 


P(E; 


7 


and P(E\E2) = 


PCE. 


Ex. 7-108. Urn A contaii 
and two black balls. One ball 
turns out to be white. What is 


Sol. Let £,: event that tr 
E,: event that t 
E;: a white bal. 
P(E, / E3) is to be obtai 
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an event (A+B) can happen, 


...(2) 


(B) ...(3) 


tion with one of the mutually 


itually exclusive. 


P(E/E;) 
g compound prob. theorem) 


)P(E/ E;,) 


as ‘priori’ and ‘posteriori’ 


mutually exclusive events 
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(i) the priori probabilities P(E,), P(E2),....P(E,,) corresponding to the total absence 
of knowledge regarding the occurrence of E and (ii) the conditional probabilities 
P(E/E,), P(E/E,)........ »P(E/E,) 
are given, the posteriori probabilities 
P(E, / E), P(Ey | E) ese, P(E, 1 E) 
are given by 


P(E,)P(E/ E 
P(E, /E) = Sep RE ED. k= 1,20 


> >) P(E) )P(E/ E;) 


i=] 


Ex. 7-107. A bridge player knows that his opponents have exactly five hearts between 
the two of them. Each opponent has thirteen cards. What is the probability that there is a 
three-two split on the hearts (i.e., one player has three hearts and the other two) ? 


Sol. Let £,: the event that two opponents have exactly five hearts between them. 
E,: the event that one player has three hearts and the other two. 
then P(E, /E,) is to be obtained. 


(E, E>) 
P(E,/E,) = P(E,) 

( cs %cm1) 

Now P(E,) = 357 
26 


and P(E, E,) = 52 39 


52 39 
Cy3- C43 
13 29 
2+ "C3. Cy. 2+ Cy.” C6 
P(E,/E,) = 52 39 13.39 
13° 13+ C5 + C2) 
78° 
115° 


Ex. 7-108. Urn A contains two white and two black balls; Urn B contains three white 
and two black balls. One ball is transferred from A to B; one ball is then drawn from B and 
turns out to be white. What is the probability that the transferred ball was white ? 


Sol. Let £,: event that transferred ball was white 
E,: event that transferred ball was black 
E,: a white ball is drawn from B 


P(E, / E;) is to be obtained. 
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ee eo 21 
P(E,)=—=— , P(E,)=—=— 
Now (E)) ae (E2) Fie 
4 2 3 1 
P(E,/E,)=>=—, P(E,/E,)=—=— 
MAVENS Greg tes AE 
By Bay’s theorem, 
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P(E,/E,) P(E,) 


P(E, /E,) = P(E3/ E,)P(E,)+ P(E; / E,)P(E>) 
2,1 
Zi id - ¢ 
i ie ae aa 


Ex. 109. Die A has four red and two blue faces and die B has two red and four blue 
faces. The following game is played : First a coin is tossed once. If it ‘falls heads, the game 
continues by repeatedly throwing die A; if it falls tails die B is repeatedly tossed. 


1 
(a) Show that the probability of red in any throw is 3° 


(5) If the first two throws of the die resulted in red, what is the probability of red at the 


third throw ? 


(c) If the red turns up at the first n throws, what is the probability that die A is being 


used ? 
Sol. (a) At any throws, there are two possibilities : 
(i) Coin shows head and die A is used. 
it 
a 
(i2) Coin shows tail and die B is used. 


I 1 
z f =—xX—-=—, 
Here prob. of red 16% 


1 4 
H b. eo xXss 
ere pro of red 1 *% 


tf 11 
. Prob. of t hrow = >+7= 7 
rob. of red at any throw 3°62 


(b) Let’ E,: first two throws of the die resulted in red 
| mao E,: red at the third throw. . 
To find P(E,), 


2 
2 
if die A is used, prob. of red = (2} 


if die B is used, prob. of red = (“y 


- 4x(2) +4.(4 
Eo a) aN 


PROBABILITY 


To. find, P(E,E,) 


if die A is used, 


if die B is used, 


PC 


(c) Let E: red turns up 2 


Ex. 7-110. In a bolt faci 
per cent of the total. Out of 
drawn from the produce ar 
manufactured by A, B and C 


Sol. Let E be the event t 
is being produced by A, B, C 

Then P(E,) = 0-25, PC. 
and P(E/ E,)=0-02. 

It is required to find P(. 

By Baye’s theorem _ 


P(E\/E) = 5 
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) P(E; ) 
°(E;/E,)P(E2) 


ie B has two red and four blue 
once. [fit falls heads, the game 
‘is repeatedly tossed. 


tis the probability of red at the 


probability that die A is being 


To.find, P(E, E) 
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¢ 


3 
if die A is used, prob. of red = (2) 


NO 


3 


3 ‘ 
if die B is used, prob. of red = (3) 


(c) Let E: red turns up at the first n throws 


P(E) ig 
fon 
P(E/A) = (2} 
1 n 
P(E/B) = (;) 
P(E/A) P(A) 


P(AlE) = BCE] Ay P(A) + P(E/ B) E(B) 


2y 4 

3) 2 fl 
ay to (iN y. -142"" 
ia | ee 

cl ae aa he le 


Ex. 7-110. Jn a bolt factory machines A, B, C manufacture respectively 25, 35 and 40 
per cent of the total. Out of their output 5, 4 and 2 per cent are defective bolts. A bolt is 
drawn from the produce and is found defective. What are the probabilities that it was 
manufactured by A, B and C ? 


Sol. Let £ be the event that the bolt is defective and E,, E,, E; the events that the bolt 
is being produced by A, B, C respectively. 

Then P(E,) = 0-25, P(E,) = 0-35, P(E;) = 0-40, P(E/ E,) = 0-05, P(E/E,) = 0-04 
and P(E/ E,)=0-02. ; 

It is required to find P(E, / E), P(E, /E) and P(E,/E) 

By Baye’s theorem 


P(E, )P(E/E,) 
P(E, )P(E/ E,) + P(E,)P(E/ E,)+ P(E;)P(E/ E) 


5 (0-25) (0-05) 
~ (0-25) (0-05) + (0-35) (0-04) + (0-4) (0-02) 


and 


MATHEMATICAL STATISTICS 


12s 
345 


40 
345 


80 
345° 


Similarly, P(E, /E) = 


P(E, /E) = 
Ex. 7-111. Prove that 
P(A, + Ag tere A,) < P(A,)+ P(A2)...-+ P(A, ) 
Proof. We have 
P(A, + Az) = P(A,)+ P(A2)— P(A, A2) 
P(A, + Ay) < P(A,)+P(A2) 
P(A, + A, +A3) = P(A, +A, + Az) 
< P(A,)+ P(A2 + A3) 
< P(A,)+P(Az)+ P(A3) 
Let P(A, + Agte..tAm) < P(A,)+ P(Aq)+....+P(Am) 
Then P(A, +A,+ pecenee +Amst) a P(A, + A,+ eeaeeee +Anat) 
P(A,) + P(Ag + Ag tet Amat) 


lA 


< P(A,)+{P(Az) + P(A3)+...-- +P(A ns) } 


P(A,) + P(Ag +0000 +P(Amit) 
.. By induction result follows : 


(Bool’s inequality) | 


Ex. 7-112. For n events A,, Aq... A,, Show that P(A,A3...A,) 2 P(A,) + P(A2)+..- 
+P(A,)—-(n—1) 


Sol. For two events A, and A, we have 
P(A, +A,) = P(A,)+ P(A2)— P(A\A2) 
Also P(A, +A,)S1. 
P(A,)+ P(A,)— P(A\A2) $1 


> P(A,A,)2 P(A,) + P(A,)-! 
Result is true for n = 2. 


Let result is true for n =m 
Then 


P(AyAp.--Am) 2 P(A, t+ P(Ag) — (m= 1) 
Now P(A,Ag.«Am Amat) = PCAyAg+«+Ayy) Amat) 

. 2 P(A\Ag.--Am) + P(Amat)—* 
> {P(A,) + P(Ag)+--+P(Am) —(m—1)} +P(Amat)—! 


= P(A, )teveee+P(Ama,)—(m+1-1) 


(1) 


Hi) 


(by (1) 
(using 2) 
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.. If result is true for 2 =i 
.. Result is true for any pc 


Ex. 7-113. If AjAz,---A 


Sol. 


Ex. 7-114. Jn a toss of twe 
the first coin (ii) two heads giv 


Sol. Let A,, A, be the ev: 


Then | 
(i) P(A, Ay 


(ii) P(A,Ay / Ay 


Ex. 7-115. There are five 
‘i’ has ‘i’ defective balls and , 
(i) An urn is selected at re 
_ drawn is defective. 
(ii) A ball is drawn from on 
that it came from urn 5. 
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(Bool’s inequality) _ 


A>) 


eos t+Anat ) 
tect P(Amst)) 
P(Ans1) 


yeAn) 2 P(A,) + P(Ag) +... 


Ay) 
(A) 
-1) ...(2) 
)-t (by (1) 
(using 2) 
~(m+1-1) 
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.. If result is true for n = m then it is also true for n =m+1 
.. Result is true for any positive integral value of n. 


Ex. 7-113. /f A,Aq,.....A,,B are events and P(B)>0O show that 


P(A, + Ag terse A, 1B) > P(A; /B) 
i=] 
Sol. 
P{(A, +....4A, )B} 


P(A] +......4- +A, /B) = P(B) 


P(A,B+......+A,B) 
P(A;B) 
2, P(B) 


=) P(A; /B) 


Ex. 7-114. /n a toss of two coins, find the probability of (i) two heads given a head on 
the first coin (ii) two heads given at least one head. 


Sol. Let A,, A, be the events of head on first and second coin respectively. 


Then P(A) - PlA2) = 5 
| P(A\A,A,) _ P(A\A3) 

@) PAALTA) = "pay P(A) 
P(A, ) P(A) = P(A jae 
= PAY) ae 

P(A,A3A, U A) 

(ii) POIRIER E BAG oA 

P(A,A3) 


~ P(A.) + P(Az)= P(A, Ag) 


Ex. 7-115. There are five urns numbered from 1 to 5 and each containing 10 balls. Urn 
‘i’ has ‘i’ defective balls and 10-i non-defective balls (i=1, 2.....5): — 
(i) An urn is selected at random and a ball is drawn. Find the probability that the ball 


_ drawn is defective. 


(ii) A ball is drawn from one of the urns and it is found to be defective. Find the probability 
that it came from urn 5. 
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Sol. Let A = Event that a defective ball is selected. 
B, = Umm i is selected. 


1 
Then PES i=1, 2,.....5 


P(A/B,) = 7 , (as B,; contains i defective balls) 


5 
(i) P(A) = >, P(A/B;) P(B,) 
» dz 
ijGil_ 1 56 3 
Hye leg Se 10 
Ee 
P(A/ B.) P(Bs) “3 1 
(ii) P(Bs/A) = = IS = 


3 
SY P(A/B;) P(B,) 710 


i=] 


Ex. 7-116. An urn contains 3 black and 7 white balls. At each trial a ball is selected at 
random, its colour is noted and it is replaced along with two additional balls of the same 
colour. Find the probability that a black ball is selected in each of the first three trials. 


Sol. Let B; = Event that a black ball is selected in ith trial. 
Then P(B,) = o 
oO 


and given that B, has happened, no. of black balls before second trial = 3+2 =5 and total 
no. of balls = 12. 


PCB, 1 By) == 


and given that B,B, has happened, no. of black balls before 3rd trial = 5 + 2 = 7 and total 
no. of balls = 14. 


‘4 
P(B3 1 B\Ba) = 73 
P(B,B,B;) = P(B,B,) P(B3/B,B,) 

P(B,) P(B, / B,) P(B; / B,B2) 


3.5 7 1 


Ex. 7-117. An urn contains M balls numbered I to M. Where the first K balls are 
defective and the remaining M-K balls are non-defective, n balls are drawn from the urn 
one-by-one. Find the probability that the sample of n balls contains exactly k defective balls 
when sampling is (i) with replacement, (ii) without replacement. 

Sol. (i) With replacement 


PROBABILITY 
Let Z; = no. of the ball dra 


Then sample space S is 


and ( 
Let A k 


The event that | 
Then A, = Subset of S for 
the remaining (n-k) 2,5 are s.t 
Now in n-places, k-z,’s (1 
chosen in K* ways and remaini 


O 


P( 


(ii) Without replacement 


Here C 
and Oo 
P( 


Ex. 7-118. An urn contains. 
of size n is drawn. Find the prob 
contains k black balls. 

Sol. Two possibilities are th 

(i) sampling with replacem 

(ii) sampling without replac 


Let A, = Event that samp) 


B; = Event that jth bal 
(i) With replacement 


Here P( 


P(A, | 


PC 
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i defective balls) 
eee 
10 
31 
jo 51, 
10 2 


each trial a ball is selected at 
2 additional balls of the same 
ach of the first three trials. 


ial. 


ond trial= 3+2=5 and total 


3rd trial = 5 + 2 =7 and total 


) 
'; / B\B2) 


. Where the first K balls are 
balls are drawn from the urn 
ifains exactly k defective balls 
ent. 
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Let Z; = no. of the ball drawn in jth draw. 
Then sample space S is 
S={(z, 2, eeeee z,)} 


and O(S)=M" 
Let A, = The event that there are exactly & defective balls. 
Then A, = Subset of S for which exactly k of the z,s are numbers s.t 152; SK and 
the remaining (n—k) 2,5 are s.t K+1sz, SM. 
Now in n-places, k-z,’s (1 Sz; SK) can be arranged in "c, ways; k such z,’s can be 
chosen in K* ways and remaining (n—k) z;'S can be chosen in ( M- Ky""* ways. 
O(A,) = "ce, «K* -(M- Ky 


fre KE (M - Ky""* 


P(A,) = M" 
(ii) Without replacement 
Here O(S) = M(M -1)...(M~-n-1)=(M), 
and O(A,) = "cy (K)y (M-K)y-4 
"¢,.(K), (M-K),_-4 
P(A, ) = : (M), 2 
_ Fe, ¥Ke, 
= om 


Ex. 7-118. An urn contains M balls of which K are black and M-K are white. A sample 
of size n is drawn. Find the probability that the j" ball drawn is black given that the sample 
contains k black balls. 

Sol. Two possibilities are there : 

(4) sampling with replacement 

(ii) sampling without replacement 


Let A, = Event that sample contains exactly k black balls. 


B; = Event that jth ball drawn is black 
(i) With replacement 


n, K*(M-Ky"* ; 
Here P(A,) = x arr tee (See Exercise 117) 
k-1 = n~k 


K 
P(B;) = i (-.: balls are replaced) 


P(B;/A,) = 


(ii) Without replacement 


Here P(A,) 


P(A, / Bj) = 


j-l 
P(B;)= > PBC) a 
i=0 


MATHEMATICAL STATISTICS 


P(A, / Bj): P(B;) 


P(A,) 
ie er Ke) (M ry 2 a K 
M™! : M 
"c, K* (M-Ky"* 


-K 
K-1 Chek 
C, n~k 
k-1 Yo C5 
j-l 
YPC) PB; /C;) 
i=0 


where C; = event of exactly 7 black balls in the first (j-1) draws. 


P(C;) = 


P(B,1C;) = 


This is because at th draw, total no. 
balls are black. 


PIB;) =, 


Ko,. OT, 
L 
Cj 


K-i 
M-(j-1) 
of balls in the urn is M —(j—1) out of which K -i 


i=0 jr} 
K!  u-x 
| it(K -i)! re K=i 
a M! M-j+l 
(Gi-D)i(M-j+D! 
K =f Ky! eae 
MVi(K-1-)! “IC, 
K | era 
M-1! 
M i=0 Cj 
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PROBABILITY 


PL 


Pi 


1. Prove or disprove : 
(i) If P(A/B)2 Pi 


(ii) If P(A) = P(B) 

(iii) If P(A) =0, thi 

(iv) If P(A) = P(B) 

(v) If P(B/ A) =P 

2. If A,, A,B are events 
(i) P(A, /B,)= Pi 


(ii) P(A, /B)s P(e 
3. Show that 


4. Examine the consistenc 
5. A, Band Care three in 
P(A, 


Prove that P(A) == a 


Nl 
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-(j-1) out of which K -i 


PROBABILITY 


P(B;/ Ay) = 


x (Ga) _ x 
Me 


P(A, /B;) P(B;) 


P(A, ) 


{*e eo (K/.) 


P(B;/ Ag) = 


gore oie 
Mc 


EXERCISES 


. Prove or disprove : 


(2) If P(A/ B)2 P(A), then P(B/ A) 2 P(B) 


(ii) If P(A) = P(B) then A=B 
(iii) If P(A) =0, then P(AB)=0 


(iv) If P(A) = P(B)= p, then P(AB)< p* 


(v) If P(B/A)= P(B/A), then A and B are independent. 


. If A,;, A,B are events and P(B)>0 show that : 


(i) P(A, /B,) = P(A,A / B) + P(A,A,/B) 
(ii) P(A,/B)< P(A,/B), provided A, c Aj. 


. Show that 


P(A—B)= P(A)- P(ANB) 


. Examine the consistency of the following data 


i. I 1 
P(A) =—, P(B) =~, P(AB) =—. 
(A) 4 (B) 5 (AB) 3 


. A, Band C are three independent events such that 


aC ee, CR ee ae ee 
P(ABC) = —, P(ABC) =~, P(ABC) = — 
( ) 4 ( ) 8 ( ) 2 


i ieee oH 
Prove that P(A) = a and P(ABC) = re 
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x 


10 


11 


12. 


13. 


14. 


15. 
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. Define a random variable. A random variable x takes values 0, 1, 2,.... with probability 


prceportional to 


5* 


Find the prob. that. x <5. 


From a pack of 52 cards, 13 cards are drawn without replacement. Find the probability 
that there are exactly 6 spade cards. 


13 


. Ifthree squares are chosen at random on a chess board, show that the chance that they 


should be in a diagonal line is 7/744. 


. Three squares ofa chess board being chosen at random, what is the chance that two are 


of one colour and one of another ? 


A person writes 4 letters and 4 envelopes. If the letters are placed in the envelopes at 
random, what is the chance that not more than one letter is placed in the correct 


envelope ? 
Ans. a 
24 


Four right-foot shoes are paired at random with the corresponding set of the left-foot 


shoes. Find the prob. that no correct pair is obtained. 


mm! 


From a pack of 52 cards, three are drawn at random. Find the chance that these are a 
king, a queen and a knave. 
yee 
5525 


If two balls are drawn from a bag containing 2 white, 4 red and 5 black balls. What is 
the chance that (7) both the balls are red, (i) one is red and the other black ? 


Rie 
55 11 


Find the chance of throwing a sum of 9 ina single throw of two dice. 


Find the prob. of obtaining a total of 6 in a throw of 6 dice. 


PROBABILITY 


16. 


17, 


18. 


19. 


20. 


21 


. 


22. 


23. 


24. 


Ina single throw of three 
than 11, (iii) more than 1 


A and B throw with 3 d 
number ? 


Find the chance of throw 


Find the chance of throw 


A person throws two dic: 
the number on the lowes 
chance that the sum of th 


There are 10 tickets 5 of: 
1, 2, 3, 4, 5. What is the ¢ 
replaced at every trial, (ii 


Out of 20 consecutive m 
their sum is odd. 


A bag contains 50 ticket 


arranged in ascending on 


probability that x, = 30 ‘ 


Nine cards are drawn at r 
the numbers ‘1’, ‘0’ or ‘1 
drawn. Find the chance tt 
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ues 0, 1, 2,.... with probability 


acement. Find the probability 


3 | 39 
Ci 


how that the chance that they 


vhat is the chance that two are 


Ans. ie 
21 


re placed in the envelopes at 
tter is placed in the correct 


| Ans. Z| 
24 


*sponding set of the left-foot 


am 


d the chance that these are a 


Ange 
"5525 


:d and 5 black balls. What is 
id the other black ? 


hie 
55 11 


of two dice. 
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ee 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24, 


Ina single throw of three dice, what is the chance of throwing (i) ‘four-five-six, (ii) less 
than 11, (iii) more than 10 ?” 
Ans. es he ul 
36 2 2 


A and B throw with 3 dice; if 4 throws 8, what is Bs chance of throwing a higher 


number ? 
Ans. fa 
; 27 


Find the chance of throwing 10 exactly in one throw with 3 dice. 


Find the chance of throwing (i) 18, (ii) 10 exactly in one throw of 4 dice. 


Arie 
81 81 


A person throws two dice, one the common cube and the other a regular tetrahedron, 
the number on the lowest face being taken in the case of the tetrahedron; what is the 
chance that the sum of the numbers thrown is not less than 5 ? 
3 
Ans. — 
4 


There are 10 tickets 5 of which are blanks and the others are marked with the numbers 
1, 2, 3, 4, 5. What is the prob. of drawing 10 in the three trials, (7) when the tickets are 
replaced at every trial, (ii) if the tickets are not replaced ? 
ee ge 
1000 60 


Out of 20 consecutive numbers two are chosen at random, find the probability that 


their sum is odd. 
Ans, i” 
19 


A bag contains 50 tickets numbered 1, 2,...50 of which 5 are drawn at random and 
arranged in ascending order of their numbers. (x, <x, < x3 <x4< x5). What.is the 


probability that x, = 30? 


Nine cards are drawn at random from a set of cards. Each card is marked with one of 
the numbers ‘1’, ‘0’ or ‘1’ and it is equally likely that any of the three numbers will be 
drawn. Find the chance that the sum of the numbers drawn is zerd. , 
ans. 2 
3° 
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25. 


26. 


27. 


28. 


29. 


. 30 


31. 


32. 


33. 


34. 


MATHEMATICAL STATISTICS 


A party of 21 persons take their seats at a round table. What are the odds in favour of 
two specified persons sitting together ? 
[Ans. 1 : 9] 


A number is chosen from each of two sets : 
1, 2, 3, 4, 5, 6, 7, 8, 9; 1, 2, 3, 4, 5, 6, 7, 8, 9. 


If P, denotes the probability that the sum of the numbers be 10 and P, the probability 


[Ans <] 
81 


A card is drawn from an ordinary pack and a gambler bets that it is a spade or an ace. 
What are the odds against his winning his bet ? 


that their sum be 8, find P, + P,. 


[Ans. 9 : 4] 


Find the probability that in a random arrangement of the letters of the word 
‘UNIVERSITY’ the two ‘I’s don’t come together. 

A party of ‘n’ men of whom ‘4’, ‘B’ are two form single rank. What is the chance that 
(i) A, B are next one another, (i/) exactly ‘m’ men are between them, (ii/) not more than 
‘m’ men are between them ? 


la 52; nama), wren 
n(n-1) 


“nn? n(n-1) 
If the letters of ‘ATTEMPT’ are written down at random, find the chance that (i) all the 
‘T’s are together, (ii) no two ‘T’s are together. 
Ans. a 2 
Loa 


Find the number of ways in which ‘p plus signs’ and ‘q minus signs’ may be placed in 
a row So that no two.minus signs are together. 

A letter is chosen at random out of ‘ASSININE’ and one is chosen at random out 
of ‘ASSASSIN’. Show that the chance that the same letter is chosen on both 


rhs Ses te ad 
occasions is 7- 
Six cards are drawn at random from a pack of 52 cards. What is the probability that 3 


will be red and 3 black ? 
gs 13000 
39151 


A bag contains 6 white and 9 black balls. The drawings of 4 balls are made such that 
(a) the balls are replaced before the second draw, (5) the balls are not replaced before 
the second draw. Find the probability that the first drawing will give 4 white and the 
second 4 black balls in each case. 


Re 
5915 715 
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35. 


36. 


37. 


38. 


39. 


40. 


41. 


42. 


43. 


44, 


45. 


Obtain the probability tha 
days of the week, assumin 


From a group of 25 persor 
(Assume a 365-day year a 
Cards are drawn one-by-o 
cards will precede the firs! 


(Hint: See 7-93) 


The odds against A solving 
the same problem are 7 to 
both try ? 


A hand of 13 cards is deal 
that the hand contains no 
contains 4 cards of one sui 


Two drawings each of 3 ba 
the balls being replaced be 
will give 3 white and the s 


A and B draw from a bag 
chances of first drawing a 


A is one of 6 horses entere 
C. It is 2: 1 that B rides A 
rides A his chance is triplec 
A person draws a card fro 
doing so until he draws a ‘c 
three trials, (ii) exactly thr 


Three urns contain respecti 
2 white and 3 black balls. C 
among the balls drawn the: 
A bag contains 17 counter: 
replaced; a second drawing 
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What are the odds in favour of 


[Ans. 1 : 9] 


s be 10 and P, the probability 


ets that it is a spade or an ace. 


[Ans.9 : 4] 
t of the letters of the word 


2 rank. What is the chance that 
‘ween them, (ii/) not more than 


-~m-1) (m+)l(n—-m-—2) 
‘n-1) , n(n~- 1) 
1, find the chance that (7) all the 


(era 
7 7 


minus signs’ may be placed in 


one is chosen at random out 
me letter is chosen on both 


. What is the probability that 3 


aie 13000 
39151 
s of 4 balls are made such that 


e balls are not replaced before 
wing will give 4 white and the 


oe: 
Ange 
5915 | 
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35. 


36. 


37. 


38. 


39. 


40. 


41. 


42. 


43. 


45. 


Obtain the probability that the birth-days of seven people will fall on seven different 
days of the week, assuming equal probability for the seven days. 
éi 


From a group of 25 persons, what is the prob. that all 25 will have different birthdays. 
(Assume a 365-day year and that all days are equally likely). 

Cards are drawn one-by-one from a full deck. What is the probability that exactly 10 
cards will precede the first ace. 


(Hint: See 7-93) 4165 
The odds against A solving a certain problem are 4 to 3 and odds in favour of B solving 
the same problem are 7 to 5S. What is the probability that the problem is solved if they 
both try ? 


i 


A hand of 13 cards is dealt out randomly from a full deck of 52 cards. Find the prob. 
that the hand contains no spade card. Also show that the probability that the hand 
contains 4 cards of one suit and 3 cards each of the other three suits is 


13 13 3 
4.3 64 ae 
Cj3- 


Two drawings each of 3 balls are made froma bag containing 5 white and 8 black balls, 
the balls being replaced before the second trial. Find the chance that the first drawing 
will give 3 white and the second 3 black balls. 
Ans. a 
20449 


A and B draw from a bag containing 3 white and 4 black balls. Find their respective 
chances of first drawing a white ball (the balls when drawn not being replaced). 


Ans. oils 2 
35 35 


A is one of 6 horses entered for a race and is to be ridden by one of two jockeys B and 
C. It is 2: 1 that B rides A in which case all the horses, are equally likely to win; if C 
tides A his chance is tripled. What are the odds against his winning ? [Ans. 13 : 5] 
A person draws a card from a pack, replaces it, and shuffles the pack. He continues 
doing so until he draws a ‘club’. What is the chance that he will have to make (i) at least 
three trials, (i?) exactly three trials ? 

Ans. @ ; 2| 

16 64 


. Three urns contain respectively 1 white and 2 black balls; 3 white and 1 black balls and 


2 white and 3 black balls. One ball is taken at random from each urn. Find the prob. that 
among the balls drawn there are 2 white and 1 black ball. 

A bag contains 17 counters marked with the numbers 1 to 17. A counter is drawn and 
replaced; a second drawing is then made, find the chance that the first number drawn is 
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46. 


47. 


48. 


49. 


50. 


51. 


52. 


53. 


54. 


even and the second odd. 


Three urns respectively contain 1 white and 3 black, 2 white and 4 black and 3 white 
and 1 black balls. A ball is drawn from an urn selected at random, find the chance of its 


being white. 
Ans 4| 
9 


1 
Criticise the statement : ‘the chance of throwing ace in the first trial is 6 and the 


: Segal ; Shane 
chance of.ace.in the second trial is — , therefore the chance of ace in two trials is >.’ 


Counters marked 1, 2, 3 are placed in a bag and one is withdrawn and replaced. The 
operation being repeated three times, what is the chance of obtaining a total of 6 ? 


| Ans Z| 
27 


A, B, Cin order cut a pack of cards, replacing them after each cut, on the condition that 
the first who cuts a spade shall win a prize. Find their respective chances. 


Ans. Ie, ae — 
37 37 37 
Six persons throw for a stake, which is to be won by the one who first throws head with 
a coin. If they throw in succession, find the chance of the fourth person. 


Ans. iD 
[ans-5| 


A and B play fora prize; A is to throw a die first and is to win if he throws 6. Ifhe fails, 
B is to throw and to win if he throws 6 or 5. If he fails, A is to throw again and to win 
with 6 or 5 or 4, and so on. Find the chance of each player. 

[Ans Paes B = 


324° 324 


A certain stake is to be won by the first person who throws an ace with an octahedral 
die. If there are 4 persons, what is the chance of the last ? 
Ans. eae 
1695 


A, B, C, D cut a pack of cards successively in the order mentioned. What are their 
respective chances of first cutting a spade ? 


64 48 36 27 
AMS. 3 3 te 
175 175 175 175 


Five persons\4, B, C.D, E throw a die in the order named until one of them throws an 
ace; find their respective chances of winning, supposing the throws to continue till an 


oe on (8). (8): (=) 
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55. 


56. 


57. 


58. 


59. 


60. 


61. 


62. 


63. 


64. 


How many throws with a ¢ 


getting ‘double six’ at least 


How many throws with a sin 


an ace at least once greater 


If the prob. of success be 0: 


least one success is greater | 


The odds against a certain 
independent of the former, a 
happen. 


The odds that a book will be 
4:3 and 3 : 4 respectively. ' 
will be favourable ? 


A throws two coins and B 
greater number of heads tha 


There are three works, one | 
volume. They are placed on 


same works are all together 


There are two bags, one of \ 
and 12 white balls. One ball 
change of drawing a red bal. 


Four students are selected at 
that the selected students inc 
least two boys. 

A bag contains 4 white, 5 re 
prob. that 

(a) No ball drawn is black. 
(b) Exactly two are black. 
(c) All are of the same colo 
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white and 4 black and 3 white 
it random, find the chance of its 


mf] 


1 
in the first trial is 6 and the 


ance of ace in two trials is —.’ 


s withdrawn and replaced. The 
2e of obtaining a total of 6 ? 


Ans. a 
27 


reach cut, on the condition that 
respective chances. 


hie eee 
37 37 37 


one who first throws head with 
the fourth person. 
Ans. 4 
63 


‘0 win if he throws 6. If he fails, 
A is to throw again and to win 


ayer. 
Ans. A ele go> 
324 324 


rows an ace with an octahedral 


it? 
Ans, usd 
1695 


der mentioned. What are their 


“1757 175° 175° 175 


1ed until one of them throws an 
ig the throws to continue till an 


F060 


ans 4, $8. 36, 2) 
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55. 


56. 


57. 


58. 


59. 


60. 


61. 


62. 


63. 


64. 


How many throws with a pair of dice are necessary in order to have the chance of 


l 
getting ‘double six’ at least once greater than a: 


[Ans. 25] 


How many throws with a single die are necessary in order to have the chance of getting 


lig 
an ace at least once greater than 3 z [Ans. 4] 


If the prob. of success be 0-01, how many trials are necessary in order that prob. of at 


1 
least one success is greater than 5 [Ans. 69] 


The odds against a certain event are 5 : 2 and the odds in favour of another event, 
independent of the former, are 6 : 5. Find the chance that one at least of the events will 


happen. 
Ans. 2s 
77 


The odds that a book will be favourably reviewed by three independent critics are 5 : 2, 
4:3 and 3 : 4 respectively. What is the probability that of the three reviews a majority 
will be favourable ? 


A throws two coins and B throws three coins. Find the chance that B will throw a 
greater number of heads than A. 
Ans. = 
2 


There are three works, one consisting of ‘3’ volumes, one of ‘4’ and the other of ‘1’ 
volume. They are placed on a self at random; prove that the chance that volumes of the 


3 
] is ——. 
same works are all together is 140 


There are two bags, one of which contains 5 red and 7 white balls and the other 3 red 
and 12 white balls. One ball is to be drawn from one or other of the two bags. Find the 
change of drawing a red ball. 


Four students are selected at random from 7 boys and 4 girls. Calculate the probabilities 
that the selected students include (i) two specified boys, (ii) exactly two boys, (iii) at 
least two boys. 

A bag contains 4 white, 5 red and 6 black balls. Three are drawn at random. Find the 
prob. that , 

(a) No ball drawn is black. 

(5) Exactly two are black. 

(c) All are of the same colour. 


266 


65. 


66. 


67. 


69 


70. 


71. 


72. 
73. 


74, 


75. 


76. 
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A has ‘3’ shares ina lottery in which there are ‘3’ prizes and ‘6’ blanks B has ‘1’ share 
in a lottery in which there is ‘1’ prize and ‘2’ blanks. Show that A’s chance of success 
is to B’s as 16:7. 

If p is the prob. that a man aged x years will die in a year. Find the prob. that out of ‘m’ 


men Aj, A,...A,, each aged x, A will die in a year and be the first to die. 


| Ans ca {l-(d- p| 
m 


It is 8 : 5 against a person who is now 40 years old living till he is 70 and 4 : 3 against 
a person now 50 living till he is 80. Find the prob. that at least one of these persons will 


59 
be alive 30 years hence. Ans, — 


. The prob. that a 50 years old man will be alive at 60 is 0-83 and the prob. that a 45 years 


old woman will be alive at 55 is 0-87. What is the prob. that a man who is 50 and his 
wife who is 45 will both be alive 10 year hence ? 

{Ans. 0-7221] 
Suppose that it is 9 : 7 against a person A who is now 35 years of age living till he is 65 
and 3 : 2 against a person B now 45 living till he is 75; find the chance that one at least 


of these persons will be alive 30 years hence. 


‘A number consists of 7 digits whose sum is 59; prove that the chance of its being 


4 
divisible by 1l is —. 


21 
If two coins are tossed 5 times, what is the chance that there will be 5 heads and 5 
; 63 
ils ? Ans. — 
tails ? | =| 


Find the chance of obtaining at least one six in a throw of four dice. 
Show that the chance of throwing at least one ace in a single throw with two dice 


i 
is 35° ; 
What is the probability of getting 9 cards of the same suit in one hand at a game of 
13, 39, 4 
(x) C4. Cy 
bridge 2 an races; a | 
‘ C13 


In three throws with a pair of dice, find the chance of throwing doublets at least once. 


One bag contains 3 white balls and 2 black balls, another contains 5 white and 3 black 
balls. If a bag is chosen at random and a ball is drawn from it, what is the chance that it 


7 49 
: ‘ Ans. — 
is white i 2 
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77. Four cards are drawn witho 


78. Find prob. in Ex. 40 if the t 


79. If the war breaks out on the 
a stretch there will be no w 


80. Three newspapers A, B, Ci 
that of the adult population 
B, 5% read both A and C, 4( 
read at least one of the pape 
both A and B ? 


81. There are three boxes cont 
3 red, 1 black balls; 3 whit 
it two balls are drawn at rar 
prob. that they come from : 


82. The probability that a persc 


2 
hit the same target is 3 ] 


person fires 5 shots. They f 
the target ? 
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5 and ‘6’ blanks B has ‘1’ share 
how that A’s chance of success 


r. Find the prob. that out of ‘m’ 
| be the first to die. 


| Ans es id= p| 
m 


ag till he is 70 and 4 : 3 against 
t least one of these persons will 


‘83 and the prob. that a 45 years 
). that a man who is 50 and his 


[Ans. 0-7221] 
i years of age living till he is 65 
‘ind the chance that one at least 


'e that the chance of its being 


at there will be 5 heads and 5 


Ans, pa 
256 
of four dice. 


a single throw with two dice 


suit in one hand at a game of 


13, 39, 4 
; Co C4. Cy 
[an = 


rowing doublets at least once. 


er contains 5 white and 3 black 
om it, what is the chance that it 
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77. 


78. 


79. 


80. 


81. 


82. 


Four cards are drawn without replacement. What is the prob. that these are all aces ? 


Ans. A 
270725 


Find prob. in Ex. 40 if the balls are not replaced before the second draw. 


Ans. mae 
| 


If the war breaks out on the average once in 25 years, find the prob. that in 50 years at 
a stretch there will be no war. , 


[Ans. e| 


Three newspapers A, B, C are published in a certain city. It is estimated from a survey 
that of the adult population : 20% read A, 16% read B, 14% read C, 8% read both A and 
B, 5% read both A and C, 40% read both B and C, 2% read all three (7) What percentage 
read at least one of the papers ? (ii) of those that read at least one, what percentage read 
both A and B? 

[Ans. 35%, 28%] 
There are three boxes containing respectively | white, 2 red, 3 black balls; 2 white, 
3 red, 1 black balls; 3 white, 1 red, 2 black balls. A box is chosen at random and from 
it two balls are drawn at random. The two balls are one red and one white. What is the 
prob. that they come from the (i) 1st box, (ii) 2nd box, (iii) 3rd box ? 


Mate 
11 11 11 


3 
The probability that a person can hit a target is 5 and the prob. that another person can 


2 
hit the same target is 3 But the first person can fire 4 shots in the time the second 


person fires 5 shots. They fire together. What is the prob. that the second person shoots 


the target ? 
Ans, 2. 
11 


Mathematical Expectation 


8.1. Stochastic Variate. The variate which can take certain values depending on chance 

is called chance variate or stochastic variate or random variate e. g., In rolling a die the 
‘variate corresponding to the number obtained is a stochastic variate. 

Variables are generlly denoted by capital letters (i.e., X, Y etc.) and corresponding 
small letters represents their values. In certain cases some small letters are used for both 
purposes. 

Probability Distribution. The dist obtained by taking the possible values of a chance 
variate together with their respective probabilities is called prob. dist. 


Expected value of a chance variate. 
Let x be the chance variate with prob dist. 


Then expected value of x is defined to be x,p,+x2p2+...+x,p, and is denoted by 


E(x). Thus, E(x) = x, p, +X. pot...4+X,Pp- 

Ex. 8-1. Let x denote the profit that a man makes in a business. He may earn 2,800 with 
probability 0:5; he may lose Rs. 5,500 with probability 0-3 and he may neither earn nor lose 
with probability 0:2. Calculate the mathematical expectation of x. 

Sol. The given prob. dist is 


x—>{-5500 0 2800 
p>\ 0:3 0-2 0-5 
-. E(x) =(-5500)(0-3) + (2800)(0-5) 


=—-1650 +1400 = —250. 
Ex. 8-2. Find the. expected value of the number of points that will be obtained in a 
single throw with an ordinary die. 
Sol. Lex x be the number of points obtained in a single dhirow with an ordinary dice. 
Then x can take values 1, 2, 3, 4, 5, 6. 
Also prob. of getting any number with a single die 


Therefore, expected value of x 


= cU+243444546) 


2 
6 
dl 
+ 
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Ex. 8-3. From a bag contair 
draw 2 coins indiscriminately. F 
Sol. Prob. of drawing 2 ‘20 


Prob. of drawing 1 ‘20 P’c 


Prob. of drawing 2 ‘25 P’ ci 


The person gets 40 P, 45 P: 


. Expectation of the perso: 

Ex. 8-4. A person draws 2 lt 

to receive 10 P for every white 
expectation. 


Sol. Three different possibi 
(7) The person draws 2 white 


(ii) The person draws 1 whi 
this happening 


(iii) The person draws 2 red 


.. Expectation 


tation 


1 values depending on chance 
riate e.g., In rolling a die the 
variate. 

> Y etc.) and corresponding 
nall letters are used for both 


2 possible values of a chance 
rob. dist. 


..+X,P, and is denoted by 


1ess. He may earn 2,800 with 
he may neither earn nor lose 


of x. 


s that will be obtained in a 


hrow with an ordinary dice. 
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Ex. 8-3. From a bag containing 2 ‘20P’ coins and 3 ‘25 P” coins, a person is allowed to 
draw 2 coins indiscriminately. Find the value of his expectation. 


Prob. of drawing 2 ‘25 P’ coins 


Sol. Prob. of drawing 2 ‘20 P’ coins 


Prob. of drawing 1 ‘20 P’ coin and 1 ‘25 P’ coin 


Fey 1 
* Se, 10 
fie og 3 
Sa 5 
“ce 3 
2. 10 


The person gets 40 P, 45 P and 50 P in three cases respectively. 


.. Expectation of the person 


= 40 4. +452 +502 
10 5 10 


4+27+15= 46P 


Ex. 8-4. A person draws 2 balls from a bag containing 3 white and 4 red balls. Uf he is 


to receive 10 P for every white ball which he draws and 20 P for each red ball. Find his 


expectation. 


Sol. Three different possibilities are : 
(i) The person draws 2 white balls. In this case he gets 20 P and the prob. of this happening 


this happening 


Meg, al 


i (ii) The person draws 1 white and 1 red ball. In this case he gets 30 P and the prob. of 


*6) x 4c, _ 12 
Tos 21° 


(iii) The person draws 2 red balls. In this case he gets 40 P and the prob. of this happening 


“. Expectation 


= 1903430124 40.6} 
21 

ome {60 + 360 + 240} 
21 


1 
= — (660 
Th ) 


= 220 -31P. 
7 
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Ex. 8-5. Three urns contain respectively 3 green and 2 white balls, 5 green and 6 white 
balls and 2 green and 4 white balls. One ball is drawn from each urn. Find the expected 
number of white balls drawn out. 


Sol. Let x be the number of white balls drawn. Then possible values of x are 0, 1, 
2 and 3. 


Let po, P,, Pz and p3, be the probabilities of x taking these values respectively. 
Now Po = prob. of drawing all the three green balls. 


~3 35 21 
511 6 IL 
Different possibilities of drawing 1 white and 2 green balls are : 
lst urn 2nd urn 3rd urn 
W G G 
G W G 
G G W 


Where ‘G’ denotes the green ball and ‘W’ the white ball. 


pS £4 oe sae fo es Jee = 
5 116 511 6 5 11 6 165 


3 6425 4 2 6 2 


Simil 2.3.04 Peas 42.6 2 
3 ee St 6 S16 5 1 6 
_ 68 
165 

2 Ak 
PAS il 6 35 
ee ee a eee eee 


Sen o : +3.—= ‘ 
11 165 165 55. 165 


Ex. 8-6. What is the expectation of the number of failures preceding the first success in 


an indefinite series of independent trials with constant probability p of success ? 
Sol. Let x be the number of failures preceding the first success. Then x can take values 
05.1, Qissisdieccedscaee Resdesicisiincatis 
with respective probabilities 


PiOPG Dives GPs 
E(x) = O0.p+l.qp+2.q7p+...tn.g" pt... 


= pq(1+2qt....tng” +...) 


= pq(l-q) 
= PT - gi p. 
P 


Ex. 8-7. A makes a bet with B of Rs. 5 to Rs. 2 that in a single throw with two dice he 
will throw 7 before B throws 4. Each has a pair of dice and they throw simultaneously until 
one of them wins, equal throws being disregarded. Find B's expectation. 


MATHEMATICAL EXPECTATION 


Sol. Prob. of getting 7 ina 


and prob. of gett: 


Since total prob. is unity, p 


Now A wins if he throws 7 1 
not throw 7 or 4. 
«. Prob. of A winning in fi 


and prob. of B winning in | 
...Prob. of none-winning i 


Now A wins in second tri: 
neither 7 nor 4 in second trial. 
“. Prob. of A winning in s¢ 


Similarly prob. of B winni 


prob. of A winning in thirc 


prob. of B winning in thirc 


and so on. 
.. A’s chance of winning 
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tite balls, 5 green and 6 white 
each urn. Find the expected 


iossible values of x are 0, 1, 


ese values respectively. 


‘Is are : 
‘durn 
G 
G 
W 
ie ce 
5 11 6 165 
2.6 2 
5 11 6 
8 _ 266 
55° 165° 


preceding the first success in 
dility p of success ? 
ccess. Then x can take values 


ingle throw with two dice he 
y throw simultaneously until 
cpectation. 
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Sol. Prob. of getting 7 in a single throw with two dice 


oe 
6 
: 1 
and prob. of getting 4 = 5: 
Since total prob. is unity, prob. of throwing neither 7 nor 4 
a 
6 12 4 


Now A wins if he throws 7 but B does not throw 7 or 4 and B wins if throws 4 but A does 
not throw 7 or 4. 
.. Prob. of A winning in first trial 


sted 
6 4 8 
and prob. of B winning in first trial 
fi 
12 4 16 
...Prob. of none-winning in the first trial 
bay eae oe Ss 
8 16 16 


Now A wins in second trial, if in first trial none wins and he throws 7 but B throws 
neither 7 nor 4 in second trial. 
.. Prob. of A winning in second trial 


Similarly prob. of B winning in second trial 
oe eee eee ee 


and so on. 
«. A’s chance of winning 
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ee ee 


and B’s chance of winning 


2 4. ; 15, 
Now A gets Rs. 25 =3> if he wins and pays Rs. a? are if he loses. 


B’ tati wo 2s oe 
S$ expectation a3 7 


Ex. 8-8. A coin is tossed until a head appears. What is the expectation of the number of 
tosses. 


1 
Sol. Prob. of getting a head in a toss = 2 = Prob. of getting a tail in a toss. 


Let x be the number of tosses until a head appears. Then x can take values 


1 
When x takes value 1, head appears in very first trial and the prob. for this is 3" When 


x takes value 2, first trial results in tail and second in head. So by compound prob. theorem, 


2 3 
prob. that x takes value 2 = (3 . Similarly prob. that x takes value 3 = (3) and so on. 


Therefore, expected value of x 


8.2. Indicator Function (for discrete variable) 
For a random variable x indicator fuction is defined by 


: 1 ifx=x; 
To, xppomnt,) OD = {0 ees — @=12......4) 
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Discrete density functio 


Let x be a random varial 


f,() of x is defined by 


f,.@) is called discrete d. 


f,©) is a function with ¢ 
Remark: (i) By using in 


J 


(ii) Cumulative distributi 


and converse formula is 


-e.g., Let x denote the numb 


1, 2,.......6 and probability ass 


x0 


Then PF 


using etc. remark 


and by using equation (3) 
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MATHEMATICAL STATISTICS : ‘ ; : 
i ea Discrete density function of a discrete random variable. 


Let x be a random variable with distinct values x,, x ,......X,- The density function 
f,() of x is defined by 
_ JP(x=x,) if x= x; 
£0) {0 ifxex, (=1,20...n) 


f.©) is called discrete density function of x. 


f,@ is a function with domain real line and range the interval [0, 1]. 


Remark: (i) By using indicator function, 


5 
3 , if he loses. 


n 
LO)= > Pasx asc) (1) 
i=l 
sceiten a euaner a (ii) Cumulative distribution function is given by 


F(x) = > fx) (2) 
isx; SX = 
ag a tail in a toss. and converse formula is 
: can take values f,.(%)) = F(x) - » F(x; —h) (3) 
0<h>0 i 


ieprob Aon tieds > When -e.g., Let x denote the number obtained in rolling a single die. Possible values of x are 


i 
6 and probability associated with each value is 6 


'y compound prob. theorem, » ares 
fy ; 
value 3 = (3 and so on. f,.(x) = z Lich ace) 
or 
6 
1 
—1)(x 
2 6 wt ) 
. 7 
| FQ) = Doe lacxsisy() + [6<x<0)(%) 
i=t 
Sj 
ance FBS) = Dag lisassien 5) 
i=l 
se 
ra 
using etc. remark 
1 
LOS AES I 6 [using equation (1)] 


and by using equation (3) | 
3) = F,3)- )F,3-h) 


eke n) . ; 0<h20 
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= F,(3)-F,(2) Ex. 8-10. An urn contains 
‘ : then a fair coin is tossed the nt 


3 2 = 1 : ; the expected number of heads. 


Sol. Let B,, By, B; be the 
Ex. 8-9. A coin is tossed until a head appears. Let x denote the number of tosses. Find . 
(i) density function of x 
(ii) mean and variable of x 
(iii) moment generating function of x. 
Sol. For (7) and mean see Ex. 8-8. 
To find variance: 


d 


Let x denote the number « 
Possible values of x are : | 


S=E(x*) =1 + 422+ 43? ee Lieu PC. 
2 2 
ie 4 9 16 25 
= a zt ra at 57 ese soun’ 
S 1 1 1 1 
= L435 49. +16. + lidegbede dé 
Sem pape ae eg eg 
2) 2 2 3 24 ae ae Now P(x =1/ B,)= prol 
S 1 1 1 1 1 
ie Ria aoa Tr? 3st Mites 
gee ee corre are — 
4 2? 2? 23 “94 25 
1 1 1 1 1 1 
Subtracting s(7-4] = BoP oa a oe Sleek ees 
x= 
1 1 1 1 
= 77 2 ee baed 
1 1 
= a area em P( 
2 
S=12 
Var (x) = E(x?)- x? ate 
= 12-4=8 . 
M(t) = Efe} P(x=2 
=e P(x =! 
x=1 2* 
' PC 
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tote the number of tosses. Find 


++ Aa bes, 
6G ioe evetyed 
65+ esate de 

7 - +9554 indies 
PG +955 Heeseeee 
Ty 

at sw ceeds 
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Ex. 8-10. An urn contains balls numbered 1, 2, 3. First a ball is drawn from the urn and 
then a fair coin is tossed the number of times as the number shown on the drawn ball. Find 
the expected number of heads. 


Sol. Let B,, B,, B, be the events that balls numbered 1, 2, 3 are drawn respectively. 


P(B,) = P(By) = P(By) -; 


Let x denote the number of heads. 
Possible values of x are : 0, 1, 2, 3. 
3 
P(x=1) = P (B,Ax=)) 


3 
= PB, Ax=1) 
i=} 


3 
= >) P(B))P(x=1/B;) 


i=] 


Now P(x =1/B, y= prob. of getting head when coin is tossed once 


_i 
2 
P(x =1/B,) = prob. of one head when coin is tossed twice 
2 
a 
2 2: 
P(x =1/B,) = prob. of one head when coin is tossed three times 
3 F 
5 (iV 2 
2 8 
Pe re sre oe a 
ry 3 4S oF a8 
Bee rey re ae 
24 24 
P(x =2/B,) =0 
11 1 
P(x =2/B,) A 3 4 


2 
I 
P(x =2/B3) = als 


i 
x 
™“ 
i 
N 
») 
as.) 


- P(x =2) 
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3 
>" P(B;) P(x =2/B;) 


i=] 


1 1 3{_ 5 
= ~{O04+—4+—7 = — 
3 4 8) 24 


P(x =3/B,) = 0=P(x=3/B,) 
1 
P(x =3/B,) = = 
8 
P 3 = Lave ee 
Oars ae 4 
E(x) = 0.P(x = 0)+1.P(x =1)+2.P(x = 2)+3.P(x = 3) 
aptlg 4 got 
4° "24° 24 


= 1, 
Ex. 8-11. /fx has a distribution given by 
P(x =0) = P(x =2)= p and P(x =1)=1-2p 


1 
for 0<p< 2° for what value of p, variance of x is maximum. 

Sol. x= E(x) = 0.p+1.1-2p)+2.p 

=1 
E(x?) = 0?.p+1?.1-2p)+2?.p 
= 1-2p+4p=1+2p 
bo = var (x) = E(x7)- x? 

= 1+2p-1 
=2p. 


1 
It will be maximum at p = x 


Ex. 8-12. Consider an experiment of rolling of two six faced die. Let x denote the 


absolute difference of the upturned faces. Find the density function of x. Also find E(x). 
Sol. Possible values of x are : 
0, 1, 2, 3, 4, 5. 
For x= 0, different possibilities are 
(1, 1); (2, 2); (3, 3); (4, 4); (5, 5), (6, 6) 


.. prob. for x =0 is 0°. 
36 


For x =1, possibilities are : 
(1, 2); (2, 3); (3, 4); (4, 5); (5, 6) 
(2, 1), (3, 2), (4, 3), (5, 6), (6, 5) 
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.. prob. for x =1 is a 
36 


Similarly other probabilities a1 


E(x) 


Ex. 8-13. A coin is tossed four ti 
immediately by a tail. Find distribu 
Sol. Possible values of x are : 0 

For x = 0: Possibilities are 
HHHH, THHH; TTH, 
P(x =0) 


For x = 1: Different possibilitie: 
(HT) HH; (HT) TH; | 
H(AT)H; HAT): Ti 
HH(AT); THAT), T 


P(x =1) 


For x = 2 : Only possibility is 
(HT) HT 


P(x =2) 
E(x) 


E(x) 


var (x) 


Ex. 8-14, A and B throw with o1 
player who first throws 6. IfA has th 
Sol. A can win in Ist, 3rd, 5th..... 


1 (2) 4 es 
6° 6 6” 6 e anes 


.. A’s chance of success 
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= 1)+2.P(x = 2)+3.P(x = 3) 


)=1-2p 


ix faced die. Let x denote the 


unction of x. Also find E(x). 


6, 6) 


) 
i) 
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._ 10 
“. prob. for x =1 is —. 
36 


Similarly other probabilities are : 


10 . 8 6 4 2 
= 0. +1— +2.—43 442454 
EC) 36 "36 "36. 36S 36 36 


Ex. 8-13. A coin is tossed four times. Let x denote the number of times a head is followed 
immediately by a tail. Find distribution, mean and variance of x. 
Sol. Possible values of x are : 0, 1,2 
For x = 0: Possibilities are 
HHHH; THHH; TTHH; TTTH, TTTT. 


5 
P(x =0) = 16 


For x = 1: Different possibilities are : 
(HT) HH; (HT) TH; (HT) TT: 
H(H1T)H; H(HT)T; T(HT)H; THT)T 
HH(HT); TH(H7); TT(HT) 


Pica) = 2 
Soa 
For x = 2 : Only possibility is 
(HT) HT 
P(x=2) = VY, 


E(x) = 0.—+1—+2. 


E(x*) 


var (x) = E(x?) [E(x]? 


7 9 = 5 


8 16 16° 
Ex. 8-14. 4 and B throw with one die for a prize of Rs. 11 which is to be won by the 
player who first throws 6. IfA has the first throw, what are their respective expectations ? 
Sol. A can win in 1st, 3rd, Sthy...... trials with respective chances 
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2 eee 
~ 6), 25] 1 
36 
Since there are only two players, and total probability is unity. B’s chance of success 
pu Ora 
“Hou 


6 
A’s expectation = l x11=Rs. 6 


5 
and B’s expectation = a xll=Rs. 5. 
Ex. 8-15. [fx is a random variate which assumes values 1, 2, 3, 4... with respective 
probabilities given by 
P(x =k)=q*"'p, qtp=1, 
find E(x) 
Sol. 


k-1 
BQ) = aka? 
k=1 


p> ka’ 
k=1 


pil +2q+ 3q°+...} 


p(l-q)* = 


aie 


Ex. 8-16. Let a random variate x take the values 
k 
x, =(-1) =, ae ee 


with probabilities p, =2~*. Find E(x). 


| 
M 
= 
> 


Sol. Total prob. = 
kel 
Dy Bae 
cr 
_ 1/2 a 
1-1/2 
= 2 
k -k 
E(x) = xe) ae 
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Ex. 8-17. 4 random variable: 


; 1 
proportional to 3 Find E(x). 


Sol. Let P(x =n) = x 
3" 


Where / is constant of propc 


Total pr 


Since total prob. = 1, A is giv 


~~ 


MATHEMATICAL STATISTICS MATHEMATICAL EXPECTATION oe; 
4 >} (-1)' 
1 rm * 
1 
1.1 J] 
= ~[+—-—34— 
lity is unity. B’s chance of success 2 a4 
1 1 1 
= -|l->+—-—+4. 
23 4 
= —log 2., 


Ex. 8-17. A random variable x can assume any positive integral value n with a probability 


1 
3 proportional to 3 Find E(x). 


alues 1, 2, 3, 4,...+ with respective n 
Sol. Let P(x=n)=—, 
3 
r=], Where A is constant of proportionality. 
= 1 
Total prob. = h 3" 
n= 
1 1 1 
= A4—-+— + 
(3 a 8 
= 41/3 = 
1-1/3 2 
Since total prob. = 1, A is given by 
te. 
} A. 
2 
=>A =2 
2 
P(x=n) = 5" 
o 1 
E(x) = 2D In 
n=] 
I 1 
= 24-4+2.—+3,—+4H..... 
. 32 33 \ 
Pisseees = ke i424 +3.— + 
3 3 2 


Ul 
wiry 
— 
I 
Wo | 
1 
i) 
i] 
cs) 
aes 
Liw 
N 
it 
N[w 
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8.3. Laws of Expectation. Two basic laws of expectation are 
(1) E(x+y) = E(x) + EQ) 


(2) E(xy),= E@EQ) 


provided x and y are independent. 
Proof. (1) Let x and y be two stochastic variates with probability distributions 


a ik aaeeee ‘a and ae eerres Terre a 
P>\ PiP2-Pim DOP Py ecceeeeeeees P 
Let zZ=xty 

Then z will also be a stochastic variate. 

Let Zi = XiYj 


Let pj be the probability of z taking a value Zj- 


Let A, be the event that x takes the value x; and Aj be the event that z takes the 


value Zj- 
Then (A,) = (Aj + Aja +--+ Ain) 
s. P(A;) = P(Ay + Aja t----tAin) 
Since z can take only one value at a time the events Aj, Aj2,-.--- A,, are mutually 


exclusive. 
. By total probability theorem 


P(A;) = P(Aj,)+ P(Aj2)*---++ +P(Ajn) 


ie., Pi = Pat Pinte Pin 
i= 1, 2,.....m 
Similarly P, = Pij + Prjt--+Pmy fa 1 2intenaane n. 
in n 
Now E(x+y) = E(2)= >) > iP a 
i=l j=l 
m n mon m n 
= > (x; + y;) By => xPy +) Dy jPi 
i=l j=l i=l j=l i=l | j=l 


m n 
= SoxfPa + Pinot eeees + Pin} + > y/{Piy + Patt Pm} 
i=l F jel 


m n 
= Yo xiPi +) y Pj 
i=l jel 


= E(x)+ E(y). 
(2) Def. Two stochastic variates are said to be independent, if the probability of either 
taking a particular value does not depend on what value the other variate takes. 
Let x and y be two stochastic variates with probability distributions. 
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x— be 
P—\ Pip 
Let 
Then z will also be a stocha 
Let 


Let Py be the probability o 
by compound probability theore 


Pi = 


E(xy) = E(z) = 


Remark : These laws can b 
X1,Xq...x, be nm chance variates, 
E(x, +X, ++X,) = 

and - E(x, x2...X,) = 
Ex. 8-18. Find expected valu 


Sol. Let x; be the number c 


Therefore, expected value o: 

= prod 

This because x, xp...... x 
independent of the number obtai 


But expected value of x; =- 


Therefore, expected value o} 


Ex. 8-19. Find the mathemat 
dice together. 


Sol. Let x; be the number o: 
will be 
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Peat tae ate 


ion are 


yrobability distributions 


ij be the event that z takes the 


s Aj, Ajz,-+-+Aj, are mutually 


Ain ) 


xP y +), Dia 
jal j 


i=l | j=l 


n 
}+ > 9 {Pyj + Proj t---:+Pmy} 
jel 


ident, if the probability of either 
e other variate takes. 
‘distributions. 
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sae ~ibveades an BAC ee Yn 


P\ Pi Pave Pm p->\ PP»...... P, 
Let Z=xy 
Then z will also be a stochastic variate. 
Let zy =X; yj 


Let Py be the probability of z taking a value %jj- Then since x and y are independent, 
by compound probability theorem. 


Py = P,P; 


I 
Ms 

N 

=: 

S 


E(xy) = E(z) 


m 
= > (x;y ;P;P;) 


i=l j=l 


Ee) 


E(x)E(y). 
Remark : These laws can be generalized to any finite number of variates namely: if 


X,,%X2...X, be n chance variates, then 
E(x +x) ++x,) = E(x,)+ E(x) +...+ E(x,) 


and E(x,X2...X,) = E(x) E(x2)...E(x,) provided x’s are independent. 
Ex. 8-18. Find expected value of the product of points obtained on rolling n dice together. 
Sol. Let x; be the number of points obtained on ith die. Then product of points on n 


Therefore, expected value of the product of points obtained 
= product of the expected values of Xx; 


This because x,,x5......x, are independent as number obtained on one die is 
independent of the number obtained on other dice 


7 
But expected value of x; = 5 (See Ex, 8-2) 


Therefore, expected value of the product of points obtained 


Ex. 8-19. Find the mathematical expectation of the sum of points obtained on rolling n 
dice together. 


Sol. Let x; be the number of points obtained on ith die. Then sum of points on n dice 
will be 
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a eS EEE Ee 
Therefore, expected value of s 
= sum of the expected values of x, x3.....X, 


ef 
Now expected value of x; = 5 (See Ex. 8-2) 


‘Therefore, expected value of s 


U 
Sa eS 
N[~ 
ene 

= 


Ex. 8-20. If p; be the probability of success for ith trial, find the expectation of the 


number of successes in n independent trials. 
Sol. Associate with every trial a variable which has the value ‘1’ in case of success and 


the value ‘0’ in case of failure. If x,, x......x, be the variables attached to trials 1, 2,...n, the 
number of successes in n trials is given by 
Mm = Xy+XgQt.4+X, 
E(m) = E(x,)+ E(x )+..... FE(x,) 
Since x; can take only two values ‘1’ and ‘0’ with respective probabilities p; and 

ip, its expectation is given by 

E(x;) = Lp; +0.d— p;) 

= Pi 


E(m) = py t+ pate +Pr- 
Ex. 8-21. Find the expectation of the number of white balls among c balls drawn from 
an urn containing a white and b black balls. 
Sol. Associate with every ball a variable which has the value ‘1’ if it is white and the 


value ‘0’ otherwise. If x,,x,.....x, be the variables attached to c balls drawn, the number 
of white balls is given by 
M = X, +X teutX, 
E(m) = E(x,)+ E(x2)+....+E(x,) 


Now the probability that the ith ball drawn will be white when nothing is known of the 
other balls 


a a 
irene 
E(x) = O46 | 


oe hip alle 
Wah OF alld 
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_ca 
E(m) = 

at 

Ex. 8-22. A box contains 2" ti 


i= 0, bcs. n. 
A group of m tickets is drawn atr 
of numbers on them. 


Sol. Let x,, x2,......x,, be the nu 


Consider ~x,. 
Its possible values are 0, 1, 2,..... 


Since there are "c, tickets bearii 


n 
ent 
Q" 


E(x, +Xg+ ateee +X) : 


(ii) E(x;] 
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(See Ex. 8-2) 


al, find the expectation of the 


alue ‘1’ in case of success and 
‘s attached to trials 1, 2,..., the 


spective probabilities p, and 


alls among c balls drawn from 


value ‘1° if it is white and the 


ito c balls drawn, the number 


when nothing is known of the 
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i=O,1,....... n. 


Pe _ ca 
7) a+b 


Ex. 8-22. A box contains 2" tickets among which "c, tickets bear the number i, 
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A group of m tickets is drawn at random. Find the expectation and variance of the sum 
of numbers on them. 


Sol. Let x,, x2,......x,, be the numbers on m tickets drawn. 


Consider x,. 


Its possible values are 0, 1, 2,.....n. 


Since there are "c; tickets bearing number i, probabilities of these values are 


(ii) 


E(x, +X2+ 


E(x,) = 


weced +X) 


"Cy "cy "Cy 

Qn ? 2" a on 

1 n : n ; 
a Co + 1"cy +2" cpt... +n"c,} 
1 n(n-1) 

— <Lnt+2,.———u...... + 

2. 2! nh 


n 
—(1+)"!= 
ar ) 2 


E(x,)+. 


n 


wet E(X,) 


Iv... 
5 GD "c, + E(x,) 
i=0 


sa.” Cz +32." c3+ 


nin-}) 1) 
2” 


nn-\) 1) 
2” 


(+4)? + 


aces t+n(n—1)."c,} + Ms 


2 
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To find cov. (x,,x,), simultaneous values of x, and x, are to be considered. 
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_ an-l Fale n(n+1) 
4 2 4 


Here prob. of x, taking value i and x, taking a value / is 


and 


Let 


E(x,x;) = 


1 = -n - on n ~ . 
FQ") vi. ofA cj -i. of Sorat 0 
i=0 


1 ‘ nh . 2 
——_—— i" ¢,-2"-) ie; 
2"(2" -1) p ee) 2, ; 


"Cj"; 

per mere | 28 

2"(2" -1) 

"¢;("¢, -1) fiz i 

Moray 4, 

nt n,n n n n 
Vii Ci Cc; 2 c;.(¢; -1) 
j=0 2” (2” -1) a0 2"7(2" -1) 
jzi 


i=0 j=0 


m 


by 


mn(n+1)— m(m—1) nt -n-l} 
4 2"-1 4 
mnr(n+tb) n m(m—I)n 
———* + m(m—- 1). — - ———— 
-4 4 42"-)b 


non _n(ntl) 
ee (a 4 


4 


n(nt1) 1 n? n nn+l) 
a > > — ,— .25 -————_ 
4 kot mali 
kel 


k=]. 


| 
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I 
iS] 
ge 
es 


Var (S) 


mn 


Ex. 8-23. Balls are taken on 
until the first white ball is draw. 


preceding the first white ball is 


Sol. Let x be the number c 
values of x are 0, 1, 2,....5 
Prob. of x taking the value ‘ 


= prob. of drawing a white t 


Prob. of x taking the value ‘ 
= Prob. of drawing a black 1 


in Se 


Prob. of x taking the value ‘ 
= prob. of black balls in firs 


third di 
and.so on. In general, prob. 


b 
a+b 
.. Expected value of x 
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unt+h 
4 


nd x, are to be considered. 


Jue j is 


12 ee Gr ~1) 
2"(2" +1) 


n 
-n ‘2 n,n 
c,-i. “f+ ) "ee; -1) 
i=0 


[n® ae 


[4 4 
"—n-l} 
mim —1)n 


4(2” ~1) 
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mn nm(m-—1) 
of eR 1 2 
=] {nt+l+n(m-1} EES 


mn(mn + 1) 7 nm(m—1) 
4 4(2” -1) 


E(S?)-{E(S)}’ 


Var (S) 


_mn(mn tt) nm(m=1) (a) 
4 4(2"-1) \ 2 


mn _ nm(m — 1) 
4 4(2"-1) 
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Ex. 8-23. Balls are taken one by one out of an urn containing a white and b black balls 
until the first white ball is drawn. Show that the expectation of the number of black balls 


b 
preceding the first white ball is tb 


Sol. Let x be the number of black balls drawn before first white ball. The possible 


values of x are 0, 1, 2,....b 
Prob. of x taking the value ‘0’ 


= prob. of drawing a white ball in first draw = ae 
Prob. of x taking the value ‘1’ 
= Prob. of drawing a black ball in first draw and a white ball 


: b a 
in second draw = ep ep ey 


Prob. of x taking the value ‘2’ 
= prob. of black balls in first two draws and a white ball in 
iid aeye 
mits Geb gebok arpa 
and so on. In general, prob. of x taking the value ‘r’ 


b b-1 b-r-1 a 
ak yaa = 
.. Expected value of x 


at+b-r—-1 a+b-r 


opis oe b(b-1) a 
= 0. +1. ; +2. 7. 
a+b a+b a+b-1 (a+b)(at+b-1) (a+b-2) 
~~ (a+b)(a+b-})....(atb—r—1) a+b-r 
Gee | ae) Jed en are 
atb 


at+b-] (a+ b-1l)\(a+b-2) 
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pp O=WO=Dn(b-T= 
(a+ b-1)(a+b-2).....(a+b-—r) 


b= 1)(b-2).....(b-r=1) 
ree Ue ae b=a b= 2).as bon) 


~ (a+b-l(atb-2).....(a+b-r-1) 


__[Ar+B}](b-1)(6-2).....(b-r) 
(a+ b-1)(at+b—2).....(at+b—-r) 
a. r= [A(r-1)+ B)(a+b-—r)-[Ar+ B](b-r) 
Equating co-efficients of r 


1= A(a+b)+A-A.b 
ke aed 
7 at+l 
Equating terms independent of r 


0 = (B- A)(a+b)- Bb 


- : R= a+b 
a 

— a+b — 

Alri 2*8|6-y0-2)..6-7=1 

nS ae Ba Se a es 


(a+b-1)(at+b-2)...(atb—r-l) | 


Als 222) (b-110-2).(6=1) 
~ (a+b-l(at+b—2)...(a+b—r) 


a a 


afi+225]6-n [222 ]o-0 
(atb-l)—s (at Ita +b) 


.. Expected value of x 


+> 
a+bla+b-1 (a+l(a+b-1) 


ipa See) 
ab ; : - 


= atb (a+1)(a+b-}) 
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Ex. 8-24. A bag contains a coii 
value is m. A person draws one a 
expectations. 

Sol. Let the total number of c: 

Then the average value of coi 


Now prob. of drawing a coin 


Since in first draw any coin 1 


If the coin M does not appear 
. Chance of second draw = ( 


Since there will be (n—1) coil 
draw, when it is known that secon: 


.. By compound prob. theore 


.. Expectation from second 4 


Similarly expectation from th 
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r-1l) 
r—1) 


»~1)(atb—2).....(a+b-r) 
‘b-r) 


\(b-2)....(b-r 1) 


).(atb—r—l) — 


+) (b-1)(b-2)....(b-r) 


(a+1)(a+b--1) 


i+b 


+) (6-1) 
oad ee 
I)(a+b-}) 


+ b)(b-1) 


—a_ 
b—-1) 
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b 


a 


+1° 
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Ex. 8-24, A bag contains a coin of value M and a number of other coins whose aggregate 
value is m. A person draws one at a time till he draws the coin M. Find the value of his 


expectations. 


Sol. Let the total number of coins in a bag = n. 
Then the average value of coins other than that of value M 
m 


n 


-1 


Now prob. of drawing a coin in first draw 


Es 
n 


Since in first draw any coin may appear, expectation from first draw 


If the coin M does not appear in first draw, second draw is to be made. 
. Chance of second draw = Chance of not drawing the coin M in first draw 


{él pe: ar +...4(n~1) times 


n-l 


M+m 


n 


1 


n-l 


n 


n 


r 


Since there will be (n ~1) coins before second draw, chance of drawing a coin in second 
draw, when it is known that second draw is to be made 


.. By compound prob. theorem, prob. of drawing a coin in 2nd draw 


.. Expectation from second draw 


1 


n 


-1 


Similarly expectation from third draw 


n-l 1 
n n-l 
z 
n 
Aro adiaoe 
n n- 
2 {4+ Gam 
n n-l 
n-l n-2 1 
n n-l n- 
+ {m+n} 
n n-l 
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and so on. 
Finally expectation from last draw 
_M 
n 


.. Total expectation 


= Z (M +m) + i pasa + wasn +...4M 
n n-|l n-l 
= Mah mete 4 BB. J } 
n-l1 n-l n-l 
= M+ m{(n—-1)+(n—2)+....41) - 
n(n—-1) 
SMe OED 
n(n-1) 2 
= M+—. 
2 


Ex. 8-25. Show that E|x|2|E(x)| 
Sol. | E(x)|=|Zpx| 
< 3|prl= Zplal= Bla, 


Ex. 8-26. Show by an example that the mathematical expectation need not be finite. 
Sol. Consider the prob. dist. 


e! 


P(x) =—, BO, 2 stots, 0 
x! 
oO e! 4 fay 
Here E(x!) = Dy tite Dy: 
x=0 7° x=0 


which is not finite. 


Ex. 8-27. Show that E(x”) > {E(x)}?. 


Sol. We have 

E(x-x)* = Zp(x-%)* 20 
ie, E(x? +x? ~2x%) 20. 
ie., E(x*)+%? —2XE(x) 20 
ie, E(x?) +x? -2%? 20 


B(x?) 2%? = (EQ)! 
Ex. 8-28. For any variates x and y show that 


fee - yp < fe}? Fa fey}? 


MATHEMATICAL EXPECTATION 
elaine nS ES 


Sol. We have (ax—y)* 20, 


=> 


= 


Put 


> 


Elax-yy 20 (v0 


a? E(x") + E(y?)—2ai 


{E(xy)]} 
E(xy) < 


E(xt+y 


{E(x + yy 


Ex. 8-29. For independent « 


if and only if 
E 
Sol. If E 
Var ( 


Conversely, let 
Var (xy) = Var (x) Var (y) 


=> 


=> 


E(x? y?)— #97 = {B( 
E(x" )E(y?)-¥°y" = 
X°E(y?)+ E(x’): 
x*{B(y?)- 97} + 97{ 


x2 Var (y)+ y Var ( 
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Sol. We have (ax —y)? 20, for all real constants ‘a’. 
=> E(ax-y)’20 (- prob. 2 0) 


=>  aE(x")+E(y”)—2aE(xy) 20 


7 E(xy) 
+ {m+ nl aed ov ae E(x") 
res 
{E(xy)}" 2 2, 2AE(xy)}" 20 
—_—— .E Ei —_—_—_—— 
ee UH Gye aa 
. 
7 {E(xy)}” < E(x” )E(y’) 
1-1)+(n— 2)+....+1} 2 2 
> E(xy) $ yE(x")E(y*) 
n-1) Now E(xty)* = E(x”) +E(y”)+2E(xy) 
< E(x”) + E(y”) +24 E(x”)E(y”) 
2 
. {VeCs*) + veo} 
1 1 1 
= fecx+ yy? }? < {ec}? + {20}? 


Ex. 8-29. For independent and non-degenerate variates x and y, show that 
Var (xy)= Var (x). Var (y) 


' expectation need not be finite. 


if and only if 
E(x) = 0=E(y) 
Sol. If E(x) = 0= E(y), 
Var (xy) = Efxy}? -{E(ay)}? 
= E(x’ y*)—- {EE} 
E(x?) E(y’) 
= Var (x) Var (y) 


Lvs 


Conversely, let 
Var (xy) = Var (x) Var (y) 


=> B(x?y?)-3°9? = {E(x?)- 37} {B07)- 97} 

=> B(x? )E(y?)- #9" = E(x? )E(y?)-P Ey?) - PPE )+ 2H 
=> x*E(y?)+y7E(x’)-2%’y? =0 

= ¥°{E(y?)- 97} + 97{B?)-37} =0 


=> x Var (y) +9? Var (x) =0 
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Since x and y are non-degenerate, 

Var (x) >0, Var (y) >0. 

~() » x=y=0. 

8.4. Moment Generating Function. The moment generating function (m.g.f.) of the 
chance variate about the point ‘a’ is defined to be E aed and is denoted by M(t), 


where t is the real parameter. 
Cumulative Function. The cumulative function about x = a is defined by 


K,(t)= log M a(t). If K,(t) can be expanded as a convergent series in powers of t viz., 
2 B 
K,@=k Itty ees ry +.. 
the co-efficients k,,k, etc., are called first cumulent, second cumulent etc., of the dist. 
Ex. 8-30. (i) Show that M,(t) = e™™ M(t) 


(ii) Discuss the effect of change of origin and scale on M.G.F. 

(iii) Show that the m.g.f. of the sum of ‘n’ independent variates is the product of their 
moment generating functions. 

(iv) Show that 


ce} Pa 
M,(t) = ie @— 
r=0 . 


Sol. (i) M,(t) = Efe} 


E{e* .e~“} 


e "Efe*}| 


= ,~a 
= M(t) 
wc) The transformation waa to change of origin and scale is 
x-a 
h 


where ‘a’ corresponds to change of origin and ‘h’ to change of scale. Both a and h are 
constants. X is a new variate to which x transforms. 
x=athx 


M(t) of x= E{e™} 


X= 


= E { el (a+hX ) } 

a e" E{e*) 

= e™ {M(th) of X} 
(ii) Let x,, x5,....... X, be n independent chance variates. 
Let X= FX te tN, 


M,(t) of X = Efe™} 
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(iv) 


Remark. Since from the | 
generating function. 

Ex. 8-31. (i) Discuss the 

(ii) Prove that the r-th cu 
of the r-th cumulants of the vu 


(iii) Show that k, = j,k 
Sol. (i) By Ex. 8-30 (i) 
Ko. 


Let ky, ky......and ki, kj 


2 
t 
(1) => kyttk 


-. Except first all cumu 
first) depend on scale. 
(ii) In Ex. 8-30 (iii) jak 


- 


Ky(i 
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=—E {elltit-t%a)} 


= Efe™ eo ., e™} 


yenerating function (m.g.f.) of the E {e™} E{e™?}...E {e™} 
(‘." x,’s are independent) 


{Mo(t) of x,}...{Mo(t) of x,} 


i and is denoted by M(t), 


i 


ut x = a is defined by 


(iv) M,(t) = E fe} 


ergent Series in powers of t viz., 


2 
+... efleta-aet cae} 


ond cumulent etc., of the dist. 


2 
1+ E(x~a) +5 Baa"... 


mn M.G.F. 
nt variates is the product of their 


2 
= , t iJ 
= 14th; (a) +5) Ha (a) +... 


00 1! 
eu @)= 


r=0 


Remark. Since from the function M,(t), moments can be generated, it is called moment 
generating function. 

Ex. 8-31. (7) Discuss the effect of change of origin and scale on cumulants. 

(ii) Prove that the r-th cumulant of the sum of independent chance variates is the sum 
of the r-th cumulants of the variates. 

(iii) Show that k, = j,k, =p2,k3 =p; andky =p, ~3p,”. 

Sol. (7) By Ex. 8-30 (ii) 

Ko(t) of x = log {Mg (2) of x} 
= at+ log {M)(th) of X} 


= at+ Ko(th) of X ) 


of origin and scale is 


nge of scale. Both a and h are ; 
Let k,, ky... and kj, k4...... be the cumulants for x and X respectively. Then 


2 2 
(l) => kth, + ie = at +h (th) + ky EO eG 


= 
s 
| 


= athk; 


k, =Wk r22 
.. Except first all cumulants are independent of origin but all cumulants (including 
first) depend on scale. 


(ii) In Ex. 8-30 (di) aking: log _ 


Ko(t)of X = ){Ko(s) of x;} 


i=l 
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Let ky, kp... and ki, ky! seks be the cumulants of Xand x; respectively (i =1, 2....7) 


= | k, i; Dee 


(iii) By def., 
‘ K;(t) = log M;(t) 
2 
t 
= tof arias as eae | 
log ¢1+ e e e 
g gy b2 737 Bs + ahs 
(" H, = 0) 
5 ore Harper 
= y HM. 31 b3 4 Mgt Sele ens ~~) 1h +3 3+... a 
2 3 4 
fo, 3 sb t 2,t ; 
k(x)=0>5 k,(0)~x =0 
>k (0) =x 
ky = py 
k3 = ps3 
and ky = py ~3p,” 


Ex. 8-32. If x is a variate with zero mean and cumulants k,, show that the first ton 


cumulants |, and L, of x? are given by 


Sol. We have for x, 


ki = p,'(0)=x%=0 
ky = by 
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where [L,’s are moments about r 


Then 
H. of 
Ex. 8-33. Show that 
H 
Sol. K\ 


r 


{7 t 
>ktt+k, —+..4k, —+...2 
1 2 YT r r! 


Differentiating w.r.t. ¢ 


ky +kytt+...k, See. 
(r-D! 
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ind x, respectively (i = 1, 2....n) 


ts k,, show that the first roa 
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k; = 
k, a 


H3 


Ha - 


where 1,’s are moments about mean for x. 


Then 


Ex. 8-33. Show that 


Sol. 


i if 
=> kt+k, Bene area log 
: re 


Differentiating w.r.t. ¢ 


v7! 


3p” 


1, = first cumulant of x? 


E(x)? 


= 2 = 


lL = 


ky 


second cumulant of x? 


Hy of x? 


let= y= x? 


st 
| 


ut 


)= pH, =k, 


E(y-y)? 


E(x? 


E(x* 


~k,) 
)~2ky E(x?) + ky? 


Hg — keV +k," 


ba - ky’ 


= kat ay? ky? 


ky + 3k? ~ky? 


= ky +2k,’. 


r 


’ r-l 7 
we = Cis, oes hy 


K(t) = 


Hy + pott..t pe 


k, +kat+...k, ———+...= 


(r-1)! 


2 
> {s t+ktt+k, Siteks 


t 
L+pytt..tpy te 
r: 


r-] 


t 
(r-1)! 


j=l 


log M,(t) 


r 


oe 
(r-)) 


r 


a 
PB Set pa 


' , tr? LZ 
V+ ptt py —+...4p) —+4... 
2! r! 


| 


| 
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ru : 
= py + ugtt..t We +... 


(r-1)! 
r-l 
Equating Co-efficients of (rb! 
' p ; (r -1)(r -2) 
Bp = Ayhyy +ky(r—-Dpp_2 +h; er at 


r-l r-1 r-l . 
= Kipp + CykgMyg + Ck shy gt. Ck, 
‘ 
r-l 
= y Cj-yk jMy-;- 
j=l 


8.5. Characteristic Function (c.f.) 
The characteristic function of a random variate x is defined to be 


Efe} 
where f is real and i? =~1. It is denoted by $,(t) or simply (t) 
b(t) = E(e'} 


8.5.1. Properties of characteristic Function 
c.f. possesses following properties : 


(i) $(t) is defined in every finite interval and is continuous 


(ii) (0) = E(e°) =1 


Ss e" n(x) 


<>) 
<> p(x)=1 
=|cos (xt) +i sin (xt)| = ycos? (xt) + sin” (xt) =1) 


(iv) $(t) and $(-t) are conjugate fs. 
The relation between moments and c.f. is 


er ae 
u,’ = i) | a xo} 


Theorem 8.4.2. If the prob. f” is symmetrical about 0i.e., p(—x) = p(x), $(¢) is real 


(iii) |6@|= 


itx 
é 


p(x) 


it 
ell* 


CG 


t=0 


and even f” oft. 
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Proof. By def. 
o() 
since P(x) 
(2) 
Which 
Moreover, 
o(-t) 


=> isaneven f", 


Remark 8.5.3. C.f. has an advar 
m.g.f. may or may not exist. 


One of the simplest conditions 
(i) 0(t) is bounded and conti: 
(ii) 9(0) =1 


(iii) y(x.c) = [foe ~ zeit: 


is real and non-negative for all real. 
Remark 8.5.4. If there are two 
must be identical. 


.. Foragivenc.f. (¢), there is « 


is given by 
f(x) 
Result 8.5.5. If x,,x. are inde 
bx,41 (1) 


but its converse may not be true. 
Result 8.5.6. For Bivariate dist 


x.x, (t, i) ty) 
Here x, and x, are independer 
ox,.x, (4, ty) 


Ex. 8-34. Find the distribution 
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Proof. By def. 


o(t) = Le! p(x) (1) 


since P(-x) = p(x), (1) can be written as 


(t) = z{e™ +e} p(x) 
Which is real. 
Moreover, 


b(-t) = Le p(x) 
changing x to—y 


- die p(-y) 


y 


= Der) = 0) 
y 


defined to be : 
=>? isaneven f”. 


Remark 8.5.3. C.f. has an advantage over m.g_f. in the fact that c.f. always exists whereas 


nply $(t) m.g.f. may or may not exist. 


One of the simplest conditions for a given f” (t) to beac.f. are: 


(i) 6(t) is bounded and continuous 


(i) (0) =1 


ntinuous 
Cc eC a 

(ii) w(x.e) =f [oe-aer dtdz 
Oo 


is real and non-negative for all real x and all c > 0. 
Remark 8.5.4. If there are two distributions with identical c.f.s. then the distributions 
must be identical. 


. Foragivenc.f. (¢), there is only one distribution. The density f” of this distribution 


is given by 


Le —itx 
fa) = Je onde 
27 J—« 
Result 8.5.5. If x,, x. are independent variables, then 


O42, (t) = x, (1)-9,, (t) 
but its converse may not be true. 
Result 8.5.6. For Bivariate distribution (see Chapter 11) c.f. is defined by 


Oy (t,t) = Efe xth%)) 


) =1) 


Here x, and x, are independent if and only if 
$x,.x5 (t,, ty) = $,, (1).,, (ty). 


Ex. 8-34. Find the distribution for which the characteristic f" is 


0 


tOie., p(—x) = p(x), O(2) is real 
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etl Ge? 


Sol. The density f” is given by 


f(x) 


(ii) f() 


It 


MATHEMATICAL STATISTICS 


12 


“f —itx 
= [ie o(t)dt 


= ® vite oH ay 
TT Y-0 


oO eo 
“| | eel dt+ jewel 
tt 


0 oO 


=f e(e" +e dt 


1? _, 
=| e.cos tx dx 
nt 40 


2n Jn 
Put ¢+ix =z. 
1 --x ik 22? 
=—e? e? az 
2n -0 
1 ->x co - =z? 
=—e? 2 | e? dz 
2m 0 
1-3 1 a 2 
=—e? ~=| ey? dy Put + = 
nyo y 
"1 (t) 1-3 
=—e -z-/|r]= e 
Tt 2 2, V2n 
8.6. Generating Function. 
Consider infinite series 
Ag +Qjt +ayt? +...... (1) 


with real co-efficients and variable t 


If in some interval of t, series converges say to S(t), S(t) is called generating function 


of sequence {a; } 
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e.g., (i) the generating function « 


which is 


(2) the generating function of se 


which is e’. 
If x be a discrete random variab: 


series (1) gives 


This sum function £{¢*} is call 
M,(T) 

is obtained by replaing ¢ be e" in pl 
Ex. 8-35. Let P(t) be prob. ger 

(i) x-1 (fi) 3x (iii) Pa2n@)F 


Sol. Let D; 
G) p-g: f of (x-1) 
(ii) ps: f of (3x) 
(iii) Let In 
and O(t) 
Vn+1 
Qn — ns 
Sant” —S0dneit” 

n=0 n=0 

1 

> A) = {22 — do} 
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‘t)dt 


tldt 


C) 


it+ forterat 


Put t+ix =z. 


Zz 
Put — = 
2 y 


(1) 


‘t), S(t) is called generating function 
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e.g., (i) the generating function of sequence (0, 1, 1, 1,...) is sum of the series 
$eE 4 ede 


t 
which is Ts 
|-t 


(2) the generating function of sequence {5 is sum of the series 


2 
t 

l+t+—t... 
2! 


which is e’. 

If x be a discrete random variable with non-negative integers as values and 

a; = P(x =i) = p; 
series (1) gives 
Po + Pit + Pot? tee 
= E{t*} 
This sum function £{r*} is called probability generating function. Obviously 
M)(T) = E{e™} 

is obtained by replaing ¢ be e in prob. generating function. 

Ex. 8-35. Let P(t) be prob. generating function of a random variate x. Find p.g.f. of 
(i) x-1 (ii) 3x (iii) P(x 2n) (iv) P(x > nt+)) 


Sol. Let Pi = p(x =i) 
( p-g-f of (x-l) = E{e 

= 1 ee*) _ PO 

t t 

(ii) p-8-f of (3x) = E(t") = Eft’y"} = PO?) 
(iii) Let Qn = P(x2n) 

= Dat Ppyptereeeeee 
and O(t) = Io + qittqot’+.... 


Ant = Pnei + Prasat aise seen 


Gn- Invi = Pn N20 
ao oO oa 
n n n 
> aut PS ansit => Pal 
n=0 n=0 n=0 


= (0) -"(0(1)-4,) = P(t) 
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= 9, —tP(t) 
=> oO oar ro 
Now 90 =PotPit siee.aee =] 
_ 1-tP(t) 
LG amr ie 


(iv) is left as an exercise. 
Ex. 8-36. Find p.g.f. of 
(i) P(x <n) (ii) P(x Sn) 


{Ans ea P(t) s ria} 
1-t 1-t 


Ex. 8-37. Let P(t) denote the p.gf. of a random variate x and Q(t) be the p.g.f. of 
P(x=2n). Show that 


. ul 
Q(t) = P(t?) + P(-t?) 


2 
Sol. Let P(x =i) = p; 
Then 
Pt) = DyPa? 
n=0 
and Q(t) = >) Pan #” 
n=0 


(“P(x =2n) = p2,) 


— Po + Pot + pat t+.... 


! 3 
2 2 2 
20(t) = (ns + pit? + pot + p3t? + pat vf 


1 3 
op. — pit? + pot — pst? + pyt?..... | 


2 7 
P(t?) + P(-t?) 


1 1 
P(t?)+ P(-t?) 
2 
Ex. 8-38. Let x denote the number of failures preceding the first success in an indefinite 


series of independent trials with constant prob. p of success. Find p.g.f. of x 
Sol. By Ex. 8-6 


Q(t) = 
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P(x=r)=q'p, 


P(t) = E(t”) 


tagen 


which exists only when qt < 1 ie. 


1. Let x be a random variate and | 


E(x - 
where V (x) stands for variance 
2. If‘a’ is constant, show that 
(i) E(a)=a 
(ii) E(ax) = aE(x) 
(iii) Var(ax) = a? Var(x) 
. Two fair dice are tossed. Find | 


4. A and B in turn toss an ordinar 
If A has first throw, what are tk 


’ 


1) 


5. Show that () p, =k), (i) o = 


6. Four coins are tossed. What is 


1 ; 
7. If a ey ee ae 


l 1 
P(y = -2) = —, P(y =10)=— 
(y ) 3 (y ) 3 


find E(x + y). Further, if x an 


8. In an objective type examinati 
are four answers out of which 


1 
the correct answer and a ot 
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{ans PO FO 
. l-rt 1-t 


variate x and Q(t) be the p.g.f of 


(° P(x = 2n) = po,) 


‘ 
t+ pt? +r 


3 
2 2 
t— p3t? + pyt..... | 


ing the first success in an indefinite 
cess. Find p.g.f. of x 
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P(x=r)=q'p, r=0,1,....... 


p(t) = E(t") 
= Sap! 
r=0 
= p\qt + gt’ +4 


= pat{l ay LY i ae eee } 


tasen 


which exists only when qt < J i.e. 


— 
. 


and 


I 

t<-. 
q 
EXERCISES 


Let x be a random variate and c, a constant. Show that 


E(x—c)? =V(x)+{E(x)-c}? 
where V (x) stands for variance of x. 


. If ‘a’ is constant, show that 


(i) E(a)=a 
(ii) E(ax) = aE(x) 
(iii) Var(ax) =a? Var(x) 


. Two fair dice are tossed. Find the probability distribution of the total score. 
. A and B in turn toss an ordinary die for a prize of Rs. 44. The first to toss a ‘six’ wins. 


If A has first throw, what are their expectations ? 
[Ans. 24; 20] 


k ky 
. Show that (i) p, =k, Gi) o = Jk, , Gi) Nh ea (i) 12-72 
2 


. Four coins are tossed. What is the expectation of the number of heads. 
[Ans. 2] 
If ee eee ee PGen== 
; 4° 2° 4 


1 1 l 
P =—2 =-,P =10 =-—,P =4 — a 
(y ) 3 (y ) 3 (y = 4) 3 


find E(x+y). Further, if x and y are independent, find E(xy). 


. In an objective type examination, consisting of 50 questions, for each question there 


are four answers out of which only one is correct. A candidate scores 1 if he picks up 


1 
the correct answer and 3 otherwise. If a candidate makes only.a random choice in 
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10. 


11. 


12. 
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respect of each of the 50 questions, find his expected score and the variance of. his 
score. 


. What is the expected number of double birthdays (two or more persons having the 


same birthday) in a group of n persons ? 
A coin is tossed until a head appears. Let x denote the number of tosses. Find 


D.af. of x. 
[Ans so 
2-t 


Show that 


ur-{< wc 
dt ie 


Let x be a random variable having m.g/f- er(e'-D_ Find E(x) 


Sol. M,(t) = ere'-1 
dM g(t) _ Me) yo! 
dt 3 
ats) = | Me} a 
dt.) 


Contin 


9.1. Continuous Variable 
It is the variable which car 
following x will be taken as contin 
Probability Density Functi 


variate value lying in infinitesime 
I(x) d&, is called probability den: 
The density function f (x) ha: 
(i) f(x) 20 Vx. 
(ii) J S(x) = 1 where the i 
variable x. 
In the following the density J 


Probability Differential. ‘ 
for a variate to lie in the interval 


Plas xsb) = 


Probability Curve. The cc 
curve or simply probability curve 
Distribution Function. The 


F(x) = 


is called the cumulative distributi 

The c.d.f. F(x) has the following , 
(i) FQx)=f(xj 20> FV 
(ii) F(—- °°) =0. 


fee) 
(iii) F(o)= | f(x)dx =1 
—o 
It is also sometimes denoted 
Any continuous differentiabl 
as c.d.f. of x. Then density functi 
Indicater Function : Indicé 


Tia, by (*) = 


Ex. 9-1. If fy (. ) denotes 
cumulative distribution function, 
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‘ted score and the variance of his 
(two or more persons having the 


1ote the number of tosses. Find 


Continuous Distributions 


9.1. Continuous Variable 


It is the variable which can take all possible values between certain limits. In the 
following x will be taken as continuous variable and will also be used to represent its values. 
Probability Density Function. A continuous function f (x).s.t. the probability of the 


variate value lying in infinitesimal interval x — ist orx + as can be expressed in the form 
J (x) dx, is called probability density function or simply the density function. 

The density function f (x) has the following properties : 

(i) f@) 20 Vx. 


(ii) J (x) = 1 where the integration is being extended to the entire range of the 


variable x. 

In the following the density function will be denoted by f (x) {or f, (x)}. 

Probability Differential. ‘f(x) dx’ is called probability differential. The probability 
for a variate to lie in the interval (a, b) is given by 


b 
Plas x<b) = [ f(x)dx, 


Probability Curve. The continuous curve y = f(x) is called the probability density 
curve or simply probability curve. 
Distribution Function. The function F(x) defined by 


Fax) = J fxd 


is called the cumulative distribution function (c.d.f.) or simply the distribution function of x. 
The c.d.f. F(x) has the following properties : 

(i) F(x) =f(3 20 = F(x) is non-decreasing function. 

(ii) F(—00) = 0. 


(iii) Flee) = f S(x)de =1. 


-2 
It is also sometimes denoted by F,(x). 
Any continuous differentiable function F(x) with the above properties may be regarded 
as c.d.f. of x. Then density function of x is F’(x). 
Indicater Function : Indicater function for continuous variable x is defined by | 
1 ifa<x<b 
Tia, ») (X) = c otherwise 
Ex. 9-1. If fy (.. ) denotes the p.d.f. of the random variable X and Fy (. ) be its 
cumulative distribution function, show that 
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Sol. 


_ EX) 


Consider 


q 


R.HLS. 
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ro) 0 
J O-Fr@}de~ J Fy@)ax. 
0 —o 


{l— Fy (x)} de 

{1-P(XS$x)} dx= [ P(X>x) dx 
0 

i foal a 


a) t 
J f, (t) | “| dt (Changing the order of integration) 


Steg O18 Ot +18 


J tfe(o at 
0 


0 0 
J Fr de= J PUsx) dx 


—0 —00 


0 x 0 0 
J dx | f, dt = J ‘.0] afar 


0 -—« -2 


0 
a O 


-00 
ad i ~ I. 


foe) 0 
Jt. AOatt f eAwod 
0 —2 


Jt. A (dt = EX. 


oO 


Ex. 9-2. For a continuous random variate X, show that 


Sol. 


var(X) 


J 2x {1 — F(x) + Fy (-2)} de — X?. 
0 


2x {1 - Fy (x) + Fx (-x)} dx 


2x {P (X2x) + P(X S—x)} dx 


2x i jive as dx+ | af fore a 


-2 


ones ote 8 ot 8 


CONTINUOUS DISTRIBUTIONS 


Consider 


i= i iat IxO 
vOnfh 


0 


= 


¥ fx”) dy 


x fr (x) dx 


| ll 
mio Ot 8 Ot 8 OF 8 OB © 


2 8 


= [trod 


1= J x fda) ae 
at 


oO 
= | Pf @d 
00 
J Pfea 
= =i Ve (Xx). 
Mean Moments ete. All the 


R.H.S.= 


distribution with the difference 4 
the range of the variate. 
Median. Median ‘a’ is give 
a ive) 
J fear = J, 
ae a 
Quartiles. The lower and uj 
Q 1 
J fy de =F: 


Mode. rene is that lie o 
f'@) = 0 

fx) < 0 
provided that the solution of f’(x 
Ex. 9-3. Show that for the rn 
dF = dx 
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\dx. 


‘(X > x) dx 


ianging the order of integration) 


CONTINUOUS DISTRIBUTIONS 303 


Consider 


rf 


ll 


2x i Fx) | dx 
° ¥ 
fe0r{f anh 
0 
¥ fey) dy 
x° fy (x) dx 


2x4 f frO)dypdx 
“a 
fxn aes] 
0 
0 


y fe dy = J x? fy(a)de 


—o 


0 
x fx) dxt+ | x f(x) dx 


—0 


Fig. 9.2. 


x fx (x) dx 


| 


RHS.= | f(x) de- ¥? 


= Var (X). 
Mean Moments ete. A// the quantities e.g., mean, moments are defined as for discrete 
distribution with the difference a is replaced by f(x) dx and summation by integration over 


the range of the variate. 
Median. Median ‘a’ is given by 


J moax = f fad 


-o a 


Quartiles. The lower and upper quartiles Q, and Q are given by 


eh 1 o 
J fo) dx = = J fe) dx 
Mode. Mode is that value of x for which f(x) is maximum i.e., model value x is s.t. 
f') = 0 
f") < 0 


provided that the solution of f(x) = 0 lies within the permissible range of x. 
Ex. 9-3. Show that for the rectangular population 


dF = dx Osxsl 


304 
’ 1 1 
ws, (0) = > and 2 = 75 
1 
Sol. By def. w’:(0) = J xdx=> 
0 
1 ; 1 
n2(0) = J x de=s 
0 
11 1 
= w’, (0)— {py (0)} = ---=—. 
Me = 20) tH OF = 3-F=75 
1 
Ex. 9-4. For the rectangular distribution y = oar as xa. Show that 
a 
1 ae” 
Mo (t) = oF sin h at and Won = ad 
Sol. By def. 
pes 1 
Xx IX \a 
= e* —dx=— {e"}- 
Mo (®) J 2a Dat Ia 
= 1 Cee gd 
2at at 
F 2 
Also on = f (x-x)°" — dx 
—a 
a 
her z= AM [ z—dv=0 
where om 
—-a 
a 2n 
Han = a 2 deaf x?" dx= 2 
2a a 2n+1 
-a 0 
Ex. 9-5. Calculate B, for the dist. dF = kxe™ dx,0<x < ©. 
Sol. k is given by 
fo.8) “ 
kf xe*dx = 1 
0 
° oo “ 
or {ke af + J e* al =1 
0 
ao 
kj-e*} = 1 
or { e I, 
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10) = J 
0 
w'3(0) = J 
0 
M2 = pl 
M3 = We 
= 2: 
Bi = pb 
Ex. 9-6. Find the s.d., hari 
fix) = 61 
Sol. X = 6 
w’2(0) = 6 
bey 
s.d. 


fl 
—_ 


H.M. is given by 


oe 
H 
= 6 
_! 
ene 
To find mode put f(x) = 0 
iLe., . 1-2x = 0 
] 
or x= ot 
4 
Since f’(x) =-12<0,x= 
Let a be the median. 
t ] 
Then 6 | (x —x?)dx = is 
0 a 
a a 
or —-T Ft 
2 3 ] 
or 4a°—6a° +1 = C 
a=- 
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x <a. Show that 
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— 8 


ws (0) 


w’3(0) = 


H2 = 
H3 = 
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‘ fea) 
eo 
wet dx=|-e* an +3 xe dy =6 
0 


Ota 8 CO 
x 


ioe} 
4 de=4 | x e* dx=24 
0 


W’, (0)- ¥° =6-4=2 
1’; (0) - 3’2 (0) ¥+2%? 
24 - 36716 = 4 
bs"/ba = 2. 


Ex. 9-6. Find the s. ‘- harmonic mean, the mode and the median of the dist. given by 


Ax) = 


Sol. x 


’s (0) 


Ho = TF 


s.d. = 
H.M. is given by 


6(x - x),0<Sx <1. 


1 1 
=| x(x — x) dx = 6{3-z}- my 


i ca 3 
o Bi cB ce J 
ie x") dx 6 {3 3} 10 


of 4 ra Cae x) dx= 6| (1 —x) dx 
0 


“(eds 


A 
H 
H = 
To find mode put f(x) = 
Le., 1-2x = 
or x= 


Nl CCwW|R 


1 
Since f"(x) =-12<0,x= 5 is the mode 


Let a be the median. 


a 
Then 6 | (x —x°)dx 7 


0 
; a a 
or SSS = 
2 3 
or 4a? -6a* +1 = 
a= 
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Ex. 9-7. For the dist. ‘ 
dF = ye" dx, -0<x<0 


MATHEMATICAL STATISTICS 


1 
show that yo = >? yw’ (0)=0, o= 2 and mean deviation about mean = 1. 


Sol. yo is given by 


Yo J eld, = 1 
00 
foe) 
or 2Vo f eld, = 1 
0 
wo 
2yo J e*dx = 1 
0 
1 
yo = 2 
uw’, (0) = ; J xe" dk =0 
lo = ; J veld = | x e* dx 
—0O 0 
eo 
= |-x7e "42 xe * dem? | xe 
0 0 
-x|* t -x 
= 2 |-xe ae dx? =2 
o = v2 


1 ie) 
Mean deviation about mean = 5 J lx — | el dx 


—0O 


1 foe] ao 
ott J jxje™! dx =f xe* dx =1. 

: —00 0 
Ex. 9-8. Show that for the dist. 


2 1 
Kx) = 22/ 5) z| -asxse 


(.’ &*l is an even f” of x) 


"(CC fe] =x as x 20) 


Tm \a +x 
ae 8 
be = #49 y= at(1- 5) 
ot 3n 
q 2a 1 Qa\1. 1 x|* 
Sol. Bed, dk=—|- tan —| =1 
WGA S| Sara Ic Us 
—a -a 
F 2a 
PO) 4 = 0 
—a 
: 2at > 1 4a 4 x? 4a? -a? 
= —I|x dx =— dx 
o T a 4x" T a? +x? 


CONTINUOUS DISTRIBUTION 


H4 = 


Ex. 9-9. For a continuous 
3 

fa) = = 

Find the first three moment. 


symmetrical about the mean wii 


3 
Sol. (0) = - 
0 Ww’, (0) F 
, 3 
12 (0) ee 
3 
11's (0) = 
H2 = 
Let ‘a’ be the median 
t 1 
Then f f(@)dx = = 
0 2 
cl eres eee 
or 4) x(2 — x)dx 5 
3 
or a ce 
3 3 
or a—3a°+2 = 0 
or (a= 1) =2a=2)2 
: a= 1 


Median = Mean 
Dist. is symmetrical : 
Ex. 9-10. Find the mean d 


3 

fe) = 4 

Sol. From last example, D: 
Skewness = 0 


MATHEMATICAL STATISTICS 


about mean = 1. 


(.. eis an even f” of x) 


(. bk} =x as x20) 
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Ex. 9-9. For a continuous distribution whose relative frequency density is given by 
3x(2- 
fx) = EAD osx<2 
Find the first three moments about the origin. Hence or otherwise show that the dist..is 


symmetrical about the mean with variance = = 


2 4)2 
, 3 3 2 3 x 
Sol. h' (0) = ad Pe-narn3 {2s 7 =] 
0 
2 5)2 
Lye =f 2 a3 jl xt 6 
yw’ (0) = FJ P@-nar=3 fs | =: 
0 
2 6)2 
0) = 2 x4 _3j2,5 x | _8 
w’3 (0) = 4] 8 G-aar ate | : 


6 
H2 = w2(0)- {1 (Oy eS nee 


Let ‘a’ be the median 


a 
1 
Then =f f(x) dx = = 
, 2 
or re x(2 — x) 9 
2 
or a = 3 
or a-3a°+2 = 0 
or (a— 1) (a? -2a-2)=0 
, a= 1 


Median = Mean 
Dist. is symmetrical about mean. 
Ex. 9-10. Find the mean deviation, s.d. and skewness of the dist. given by 


3 
fx) = q 12-3), 08x82. 


Sol. From last example, Dist. is symmetrical about mean 
Skewness = 0 
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Mean deviation about mean = ix — 1x (2 —x) dx 


Blu 


2 
x(1 —x) (2—x) de + =| x(x—1)(2-x) dk 
1 


& | vo 


OH Oe ON 


|W 


2 
3 
{3 — 3x? + 2x} dx + rai {-x° + 3x’ — 2x} dx 
1 


{3 ar 3 
= —4—-141}+= Ver 
414 4\|4) 8 
For s.d. see last example. . 


Ex. 9-11. Jn Ex. 9-9 calculate 1’; (0) and 1's (0) and deduce Bp. 
2 

; a 3{x° x7\"_ 16 

Sol. w(0) = =f 8 Q-xdr== {> - >} => 
4 0 4 : 


Lg = W's (0) — 41030) 1’s(0) + 6’2(0) {u’1(0)}? - 3{ns(0)}4 


dey bar oe oe 
5 5 35 
2. 
B, = Wa 2355 
py + 7 
25 


Ex, 9-12. For the continuous distribution 
dF = yox(2-x)dx, OS x82 


show that Hon+1 = 0 
for each natural number n. 
Sol. Yo is given by 
2 
1 = yo x(2 — x) dx 
: 2 
x3 
=i. eo 
0 
4 
= Yo 3 
3 
= yor | 
3 
dF = q 22-4) de, O<x2 
As in Ex. 9-9 x = 1 
s Han+1 = E{x-1y""! 


4 
Put - x-l= yp. 


CONTINUOUS DISTRIBUTIONS 


1 
3 
ar J 
1 
3 
= ra 
= 0 
Ex. 9-13. For the distributioi 
f(x) = 1- 
find mean, s.d, By and Bp. 
2 2 
Sol. J f@ax = J | 
0 0 


Put l-x = y 
; -l1 


0 
Mean = E(x 
0 
Put l-x =y 


ll 
NS 


Il 
No 
h— o——-— 


p's (0) = 


| 
|o =n 
ne 


K—6~e oe, 


" 
NO I 
™ = 
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) dx 
2 
3 
x) dx+ ra x(x—1)(2—x) dx 
1 


2 
3 
v} dx + ral {—x° + 3x? — 2x} dx 
1 


1: 3 
thre 


id deduce Bo. 

2 
~~) 218 
5 7 i 7 


3’2(0) {u’1(0)}7 — 3 {u’s(0)}4 


kk 


CONTINUOUS DISTRIBUTIONS 


Ex. 9-13. For the distrib 


f(x) = 


find mean, s.d, By and Bo. 


Sol. 


Put 


Put 


2 
J f(x) dx = 
0 


l-x = 


1 

3 n+ 

2] Plot pa-na 
-l 


1 

3 

5 f yen ‘el —y*) dy 
-1 


0 C 


ution 
1—{jl-x|,0<x<2 


2 


J a-[-x}a& 


0 
y 


1-x = 


=f 
J (ly) Cay) 


1 
2f (1-y) dy 
0 


: {2 n2(:-2)a 
E(x) 
2 


j x {1-|1—x]} de 
0 


y 


w’2 (0) = 


-l 
J G-y) -bp Cay) 
1 


1 1 
2) d-yay- | yfl-lypey 
0 -1 


{4} 


2 
J 2 (1-[l-x} d& 
0 
-1 


J a-y? a—pih Cay), 
ae 
1 


J G-2y+y7) -biay 
-1 


ie 
2f a+y~)a-yay 
0 
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. f" to be integrated is odd). 
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w’3(0) 


H3 


By 


W's (0) 


Hee om 


] 
2) (-y+y-y} dy 
0 


[od z 7 
Da Cee Gapees eee 
23 4f 6 


ws (0) - x? 
7 1 
ae 
ali, 

ire 

2 


f © (1-|1-x}} de 
0 
-1 


J a-y = pi Co) 


1 


1 
J (-3y+3y¥-y’) 1 -[y)} ay 


-] 


| 
2) (+3Y)(1-y) a 
0 


| 
2f (l-y+3y?-3y} dy 
0 


2fi-t+3-4-3.5}-3 
2 3 4 2 


w’3 (0) — 3p’2 (0) ws (0) + 2 {us (0)}? 


=—-3-—.1+2=0 
2 6 
ae 

7 = 
12 

4 

x” {1 -|1—x|} dx 


(1-y)* {1 - pi} Cay) 


1 
J a-4y+6%-4y +y4 1- pp ay 


-l 


1 

2f +6? +34 (1-y} ay 
0 

31 
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CONTINUOUS DISTRIBUTIONS 


M4 = p14 (0) 
BP ey, 
15 
H4 
ere aes 
ne, 


Ex. 9-14. Show that for the dist. 
to o and the inter-quartile range is ¢ 
Sol. The constant yo is given by 


foe) 
yo f eS dx = 1 or; 
0 


0 
aoe 
ioe) 
1’2(0) = yo J 3 
0 
= 20° 
My = 207-| 


Let Q, and Q; be the quartile 


2 1 
Then yo J ody = = and j 
0 


1 2 2 1 
or = {-se a & = ane 
a 1 
or je Ae eT ea! 
or Q, = o log, 
Q3;-Q, = Glog. 
Ex. 9-15. Prove that the geome 
dF = 6(2-. 


is given by 6 log (16G) = 19. 


Sol. log G 


tt 
nN 
—_— 
a3 


tt 
nN 
—_— 
Ne 
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Ha = 


Boe 


311 
Hs (0) — 4’ (0) 1's (0) + 61's (0) {41 (0)}? - 3 {H'1 (0)}" 
Sede esa 
15 2 5 
Ba _ 1/15 _ 12 
ps 186." 577 


Ex. 9-14. Show that for the dist. dF = yo e*° dx, 0 <x < the mean and s.d are equal 
to 6 and the inter-quartile range is 6 log, 3. 
Sol. The constant yo is given by 


foe) 


1 
yo | ew dy = 1 ory = — 
o 
0 
co co oO 
Me ie yo f xe*? dy = — f xe*'s dx=o | ye” dy 
o 
0 0 0 
tte 
where y == 
=O 
ioe) fee) : 
nw’, (0) = yo J ere dx=o | ye dy=o'2! 
} dy 5 ; 
= 20° 
M2 = 20°-o' =e" .. Sd =O 
Let Q, and Q; be the quartile 
1 9 3 
Then yo i e*F dx=— and yo J en dy = — 
4 4 
0 0 
1 _ Oj 1 
or — {-ce ve) = — and —{ oe w/o _ 3 
H's (0)}° as o 4 0 4 
s 1 = 3 
or p-e ON = 7 and 1 eee 
4 
or Q| = 6 loge 3 and Q;=6 log, 4 
OO = see 
Ex. 9-15. Prove rs the geometric mean G of the dist. 
= 6(2- EES: 1s xs<2 
is given by 6 log ae = iG 
2 
Sol. log G = 6{ logx.(2-x)(x- 1)dx 
1 
1 ~ |y|} dy 
= 6 | y(-y+ 1) log (1 +y) dy where x-1l=y 
0 


i 


Hy 


14 : 
Steere 
9 tl ae 


2 3 
y y 
ns — — |] +1 
[3 aan ) 
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I aha 
1 1 
log 2-3 | {y-t+ havea [yea 
- ytl1{ ; yt 


y | of yy 
log 2-3 er} 2 = as wy} sos 


Laer res, 
6 


6 log (16 G) = 19. 
Ex. 9-16. The elementary probability law of a continuous random variable x is 


1 
p(x) =yoe (4) 7 <x < 00, where a, b, yo are constants. Show that yo =b=— anda=m 
oO 


— © where m, © are respectively the mean and the s.d. of the dist. Show also that B; = 4 and 


Bo = 9. 
Sol. yo is given by 


ioe) 
yo f 8% de= 1 


a 


eo (x4) © 
or CO age i ote J. yo=b 
a a 
—b(x-a) 1° 
m= yo J xeP % dx = yy xe 5 ore e O-4) dy 
a a a 
ey ee eee 
= ype Letra att 
eo 
wt’, (0) = yo J x eh) dy 
a 
2 ied (oa) 
_ Xx" b(x-a) 2 -b(x-a) | 
= ——e +— | xe dx 
Yo ho ; p | 


_ 2 * (ars) 
Fee b 
z #+2(art)-(art) could 
ve Be ob By ae 
pel pale 
COD 
1 
vy = b= 
oO 
and m-O =a 


co 9) 
u's (0) = yo f Pere Ode 


CONTINUOUS DISTRIBUTIONS 


3 = 


p’4(0) = 


Bo = 
Ex. 9-17. For the dist. 


dF = 


Yo 


find the mean, s.d. and the harm 


Sol. xX = 


MATHEMATICAL STATISTICS 


oe 
1 

.dy+2[ | y?-y+1-—— |d 

ly ip y le 


3 2 1 
yy sil 1). 
a gS llog(y+1)], 


-ontinuous random variable x is 


1 
. Show that yo = b =~ anda =m 


the dist. Show also that By = 4 and 


32 
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3 = 


B; = 


p’4(0) = 


U4 = 


B2 = 
Ex. 9-17. For the dist. 


dF = 
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3 a 2 
- Heber é =f 12 BEA) py 


3 (0) — 311’ (0) p's (0) + 2 {u's (0)}? 


3 3a? ( ‘| 5 2 Lh ‘) 
a@t—taylat—j]-34a +—-lat—|[plata— 
b bP b b b b 


20 
yo f x4 e? (x-a) dx 


a 


Yo | 
w’4(0) — 4y1’3(0) 11’(0) + 6s (0) {’1(0)}? — 34u',(0)}* 
get pied, Ot w 

3 


44a a (ass) 
7 bp OP b 


4 ioe) 
_*_be-a), 4 f e-P®-9) dy 
b as 


9 

a 

H4 

“> =9 

wi 

Bun a eo Sey = dx 0<x< t 


find the mean, s.d. and the harmonic mean. 


Sol. x= 


Ly nt+t-1 m1, _ Brtlm) 
[Cr ee ee aS 
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| 2+1]m |a+m n 


n+m+l | a|m nt+m 


, es 1 n+2-1 m-1 = B(n+2,m) 
Ws (0) = Bm | x Oe aay 
pee | ote 
. jm+n+2 | m|n ~ (m+n+1)(m+n) 
op 2 n(n+]) = n 
Be RAO) Se (m+n+1)\(m+n) (m+n) 
i (m+n)? (m+n+1) 
Bh a ae SO 
(m+n) (m+n+1) 
mn 1 
Baar m+n+1 m+n 
H.M. is given by 
a om 1 n-2 m—1 _ B@-1,m) 
H Bonny | ee ae B(m,n) 


[z=i|m |men _ (m+n-1) 


ja+m—l | n|m ; n-l 
is 
m+n—-1 
Ex. 9-18. Prove that for the dist. 
1 xt! 
dP = B(m n) (ext dx, OS x<0,n>2 


variance is . Find also the mode and moment of rth order about the origin. 


(n- ij? (n—2) 


B(m,n) 4 (l4+x)"*" B(m,n) 
[ml |n=1 m+n m 
7 [m+n |m|n n-1 
ie 1 eo ymt+2-1 Poa B(m+2,n-2) 
; B(m,n) 4 (1+xy™*" B(m,n) 
m2 [2-2 m+n _ m(m+1) 


[m+n — |ral@ — (n-1)(n-2) 


CONTINUOUS DISTRIBUTIONS 


2 * hn 
(a 
eee 

(n 
HO) = Fe 
m 


Mode is that value of x for v 


f(x), = Bi 
is maximum. 
.. Mode value x is s.t. 
f(x) = 0 
Now I'(x~) = 0 


x2 + xy" 
x = 0, 
At x = 0, f(x) = 0 which is th 


Rp eve 
tx = ne A) is 


m-1 
x= is the mod 
n+l 
Ex. 9-19. Show that for the 
1 
dF == 
lr 


mean = variance = m. Find w’,| 


ae. 

x = fn 

1 

Sol. w’, (0) = h 
Ho = pW’ 

1 

w’,(0) = In 


MATHEMATICAL STATISTICS 


a ee B(n+2,m) 


B (m,n) 


ax 


n(n+1) 
-n+1)(m+n) 


Ma 
+1) 


_ Bar-1,m) 
=" Bemn) 


ea 


1 


Sx <0o,n>2 


tent of rth order about the origin. 


2 B(m+1,n-1) 
B(m,n) 


= B(m+2,n-2) 
B (m,n) 


(m+1) 


1)(n-2) 


CONTINUOUS DISTRIBUTIONS 


- m(mt+l) m? 
2 (n=Nn-2) (n 1)? 
m 
= Gol? (22) [(m + 1) (n-1)-—m(n- 2)} 
: m(m+n-1) 
(n-1)? (n-2) 
, _ te el _ B(mtr,n-r) 
’, (0) = B(m,n) ; (xy i" a B(m,n) 
? lm-+r n-r _ (mtr-l) (m+r—2)......m 
- [m in (n-1)(n—2)...(n=r) 
Mode is that value of x for which 
1 xe! 


PO. Boman) (ex 


is maximum. 
.. Mode value x is s.t. 


f(x) = 0 and f” (x) <0 


Now S'(x) = 0 gives 
x2 (1+ x)"*"" | (m1) (1 +x) -—(m +n) x} =0 
m-1 
x = 0,-1, 
n+l 


(r<n) 


At x =0, f(x) = 0 which is the least value of f(x) x =— 1 do not belong to the range of x. 


At x = ——,, f(x) is maximum. 
n+1 Ax) 


m 
= j > 
x SoA is the mode. (m > 1). 


Ex. 9-19. Show that for the gama dist. 


i 
dF = mt bce 
m 


mean = variance = m. Find yw’, (0) and harmonic mean. 


x = oa gel ters Pe ae =m. 
| m 0 m 
Sol. wr (0) = ei pimt2-1 yx eal? ae 
|m 4 |m 


..(mt+r—-1) 
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H.M. is given by 


s ‘i Ly geo ee a eee 
H [m r |m m-1 
Bs H = m-1. 
Ex. 9-20. For a continuous dist. 
2 


x 


dF = Ye 2 x"! dy, 0<Sx<00 


n+1 , 
2 
show that’, (0) = 2 —= 
n 
2 
and p’,(0) = 7. 
Sol. yo is given by 
2 
Oe 
yo | e 2 x lag&=1 
0 
2 
Put * 24 
2 


0 
i ete Lae 
or 22 yo | et. dt 
0 
n 1 ao 
im ni 
or 22 yy l|—-=1. 
: Yo ) 
1 
MON Ta 
|e 22" 
7 
Also be def. 
i fe hraie 
yw’, (0) = yo J xe iy x" de=yo fe 2 trol oy 
0 0 
x? 
Put — =t¢. 
2 
n+r-2 o nt+r 
oes 142! 
uO) = 2 2 .yo fete? at 
_ 0 
29 — 
7 Rae 1 nt+r _ 272 \ntr 
7 eae ee eee 3 
> 2 nace Pa 
2 zs 2 
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Z| 
1, (0) = | 


Ex. 9-21. A frequency f" in th 
fe) = 56 ( 


| 
o~ 


Set 
Find the mean and the s.d. of t, 


3 
Sol. Total frequency = J I¢ 


tall 
lt 
— 
& 
« 


= 1 
0 ue — een 
16: 
1 
eee 
16 
1 
1 
- al 
3 
H2 = J * 
-3 
1 
+- 
16 
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-1 


it 


1 


m—- 
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2 “> S41 
w’, (0) = — a=>77 and yp’, (0)=2. =~" =n. 
a us 
E 2 
Ex. 9-21. A frequency f" in the range ( — 3, 3) is defined by 
1 
#2) = 75 (+xy =3276=1 
1 2 
= 76 (6-2) =1Sxel 
1 2 
= 16 (3 -x) l<xs3 


Find the mean and the s.d. of the dist. 


3 -] 
1 
Sol. Total frequency = 1 f(x) dx = 6 J (3 +x) dx 


3 
elo ace — x) dx 


I 
co | 
(ibeenen os) 
“~~ 

WwW 

tad 

YS 

[<) 

w & 
+ 

—— 
F gPne ¥ 

an 

N 

ba 
~ 

> 


0 
3 
- $[-S| 6 2(6-2)-1 
8 3 , 8 3 


-l 1 
1 a. kd 
— | x(3+x)* dx+— — 2x? 
i i6 J Gey) 16 1 x6 2x2) dx 


ta | 

Il 
ew 
ad 

i 


1 3 
+ 16 J x(3-x) ax 
1 
For first integral change x to — x 


3 1 
= 1 2 1 2 
x 2 “76! x(3-x) aera x(6— 2x)? dx 


3 
! 2 
Fig = dx a 
1 


1 2 
= 16 u x(6 — 2x°) dx =0 
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3 -l 1 
1 1 
pa =| #/@aeE J OB +xy7 det = J x (6-20) de 
-1 


318 


MATHEMATICAL STATISTICS 


For first integral change x to — x 


“. s.d.= 1. 


Ho 


F(x) 


3 1 
1 1 
=f 2(3-x7 de+=—f 2 (6-2x°) dx 
8 a) 


3 I 

1 1 

a 8 (6x4 )dx+ = J (6x7 — 2x4) de 
1 


0 
3 

5 
: 2 2S gate 4 pe eens 
8| 5 8 0 
41, 
5.5 

Ex. 9-22. The dist. of a variate x in the range (0, 2) is defined by 

x 0<x<l 
(2=x)- 1<x<2 


Calculate the mean, s.d. 


Sol. Total Freq. 


1’; (0) 


w’2 (0) 


s.d. 


and the mean deviation about the mean of the above dist. 


2 I 2 
[ fodde =f Pde+] 2-2) dx 
0 0 ] 


2 I 2 
2) xf(x)dx = 2) xAdke+2] xQ2-x) dx 
0 0 1 


2 12 
+ 2-x)* dx 
I a 


2 I 2 
2) P¢@)dx = 2) Pde+2) PQ-xra& 
0 0 1 


4 2 2 

: +2 AY ag? sof x(2—-x)* dx 
4 2 
1 I 
2 

1 1 (2-x) 1 5 
Esai oe ame de | ey ele 
rar P 5 | x) 
ae 1 16 
—+— ee eee 
6 5 30 15 
16 sac 
15 15 
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Mean deviation about mean 


= 2 
{ 
1 

1¢ 

2 

10 


. 9-23. Find yo, M2, M3 ai 


dF = yo 


Sol. yo is given by 


Put 


Put 


Put 


Put 


l1+Tx=y 
fe 
(04 
2 
mi = 8 
= By 
yp? = t 
el 
= 5 
B? 
y= 
ef 
wu’, (— B) = Yo 
ao 
x+B = a 


MATHEMATICAL STATISTICS CONTINUOUS DISTRIBUTIONS 


2 
Mean deviation about mean = 2 [ lx — 1] f(x) dx 


? (6 — 2x’) dx 0 
I 2 
vl | = 2) G-xPdk+2f @-DQ-ma& 
2) (6x? — 2x") dx 0 I 
0 1 2- 
= pt 24/5 Mec Ee =i Seiad A ees tfe- x)4 dx 
1 
ee oe ee 
8 0 eee 
10 10 5 
Ex. 9-23. Find yo, U2, W3 and Wy for the dist. 
| a [5-1] -24 2 
| is defined by dF = yo (1 +r) e e ar SX < 00, 
ix<l ae , 
“x <2 Sol. yo is given by : : 
ut the mean of the above dist. ~ [$-1] ot 
e Yo f (1+ $:] ae e %dx= 
2—x) dx r ~2/a 
Put 1+7 x =y 
2 1 ssi) 
2 2 
a0 
2 
3 2 
+2 f x(2-—x) dx Put — =8 
I a 
er = By f yf? Ve PO-0 gy 
— f (2-x)* de , 
: 1 
Put yp? = ¢ 


2 ; a Pl ota 12 Yo B? 
xt+2f 2 (2-x) de p2er-1 


pe -1 
I 
2 2 
1 2p* -I 
ole x(2—x)* dx = B 
I 
2 2 5 
ii 5 a) p*-I 
a3 J ee w’-(-B) = v0 f @+By(1+2] rn 
-B 
_ _% 
pe 
Put xtp-2 
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aU es © A ee eS SS SS Se 


we Be +r 1 
~ 262 +r-1 Jy ° <2 
B 
= = zur 
87 2p" 
Jp? 1 pr +2 
, = a te = = (R2 
uw’, (- B) a p|p = B, 1) ( B) p |p? (B +1) 
2 
ren |p? +3 _ (62 +2)(8? +) 
3 (- B) 7 |P?-B° B 
2 
- 8 +4 (g2 43)? +2)(B2 +1) 
and 4 = iP 3 = : 
Ho = w- (wij? = BP +1-BP=1 
2 2 Zz 1 
Ms = w’3—3p’2pit 2M = a_a— — 3B(B’ + 1) + 28° 
2 
= B =a 
Mg = Wa 4u’s 1 + Op’s (Wi? - 340} 
(B? +3)(B? +2)(B? +1) 
as 2 


B — 4(B? +2) (B? + 1) + 6B? (B? + 1)—3B" 


6 3 
= 3+ Be =3+ 5 o. 
Ex. 9-24. (a) Show that for the dist. 


2\P 
ar = w[1-25) dx,-as xsa,0<p<1 
a 


(r-l)a? 
r+l-2p 


Mr = Hr-2 


(b) Express ‘a’ and ‘p’ in terms of 6 and Bo. 
Sol. (a) By def., 


a x2 “Pp 
100) = yo J (1-3) dx =0 


—a 
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and 


U 


Uy 
(b) Put r=2,4 


dF 


log G 


MY, 


Yo 


a @ 


B 
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‘dy : : a : 32 —p 
B , My = w’-(O)=yo fx }1-+=] & 


—-a 


l- 
, 2 Pp 
2 —yo. a J go ee dx 
- =(B°+1) ie ae 
l= a 
wa ae 2 oe pee , 
= @ Uy-2-yoa at 2 


a4 2 (1- p) 
= a Uy-2-2 Ral Te 
: (r-la? 
-2)(B? +1 tee ee 
a ) _ p(B? +1) +28" aaemer ee ae 
(b) Put r=2,4 
oe = pa a’ _ a 
ws) is Hee ge = Sas 
rl i a = 3-2p)o 
+ 2)(B°+ 1) + 6B? (B’ + 1) ~3B" 3a” 33-2p) , 
and Ha = by =——— 
5-2p 5-2p 
ae Ha _ Ba _ 33-27) 
OA Oe Bap 
9-5By 
- P 28-83)" 
sa,Q0<p<1 2 


Ex. 9-25. For B, (1 m) variate show that 
() > eg — 
log G = a { log|/ +log|m — log| lem}, 


Sol. For B, (J, m) variate x, 


1 
dF = xd —x)""! dx, OSx<1,4,m>0 
ia 
1 | 
log G = J doga.x’ h-xy""! de 


Bm) 5 
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: 
ae I-1 m-1 4 
= Bim) - a x (1-x) “| 


to} 
= Te = Beem P= a, flog B ( m)} 


a} lim 
= al in i 


me Aha 


Ex. 9-26. Show that for the dist. 


F'1-+log|m—tog|im }. 


dF = yw {t-PA a, b-a<e<b +a 
a 
2 
a 


1 ; 7 
Yo = —, mean = b and variance = 6" 
a 


Sol. yo is given by 


b-a 
Put x-b=y 
t ly 
a = 
Yo iY a ay 1 
t y 
or 2yo J -2} dy=1 
0 a 
2 a 
or . 2yo {>= =1 
0 
_l 
Yo =~ " 
b+a 
x-b 
Mean = Yo J {i-th ax 
b-a a 
t lyl 
= yo | (r+b){I- Day 2byo (1-2) dy 
-a 
= byya=b 
age af 
M2 = Yo J ay fi- =A ae=y0 fy *(1- | wy 
b-a —a 


a ; @ a 
= 2yof y (1 -*)ay- 299 
0 
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Ex. 9-27. Show that the f" 


0 
1/: 
F(x) = a\; 
is a distribution function. 
Sol. Evidently 
Foxe 
0 
1 
and F(x) .= Da 
0 
Evidently F’(x) = 0 


F(x) is a distribution functi 
Ex. 9-28. For the distribution 


fe) = ran 


Find mean, mode, median, vari. 
Sol. Distribution function is gi 


1 x 
F(x) = — 
m 
~h 
= — tan 
nT 
: 1 
For median, F(x) = 5 
x = 0 
; 1 
For first quartile, F(x) = 4 
—tan! x+ ee 
a 27 4 
or tan! x = 
4 
x =-l 
3 
For third quartile, F(x) = a 
x= 1 
1 ive) 
M = 
ean Z h 
90 
1 oe) 
Ho = Ef, J 
—o 


I 
ajN 
re-_ 
a 
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bx 


eB m)} 


<bt+a 
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Ex. 9-27. Show that the f” 


xXx<-a 

pe <x< 

F(x) = 5 a —asxsa 
Xx>a 


is a distribution function. 
Sol. Evidently 


F(x=e)=1, F(x=—0)=0 


0 x<-a 

1 

and F(x) = 3a —asx<a 
0 x>a 


Evidently F’(x) = 0 
*, F(x) is a distribution function. 
Ex. 9-28. For the distribution with density f". 


1 
fx) = — 


tT 1l+x 


3 —-o<X< oc 


Find mean, mode, median, variance, first and third quartiles and distribution function. 


Sol. Distribution function is given by 


xX 
F(x) = z rere {ran xt | 
1 2 


WS 5, 14x? 
= —tan! x+ ‘ 
1 2 
: 1 
For median, F(x) = 5 
x= 0 
: 1 
For first quartile, F(x) = a 
Es tan7! x + eee 
cs Tt 2 4 
or tan!’ x = 
4 
x =-1 
: 3 
For third quartile, F(x) = a 
Pea | 
[oe oy 
Mean = — I. 7 dx=0 
mT i+ 
1% x? ae 
=~ J z dx =— J ie 
nm , l+x tO 
2 oo. 
“Sve 
1 2 


Evidently pt. does not exist. 
For modal value x, f’ (x) = 0 


1 -2x _ 
T (+x? y 
Modal value = 0. 
Ex. 9-29. For the distribution given by 


gia b<x<0 
fo b(a+b) 
XE = 
2@-*)  gex<a 
a(a+b) 
find mean, median and variance: 
: a 
Sol. Total prob. = J f(x) dx 
= b+x)dx te 
a. | ( ue 


2 
= : fohaas 
b(a+b) “2 


a 


mean = E(x)= J xf(x) dx 


b(at+b) ea 


pee al 
~ b(at+b) | 6 | a(a+d) 


a 


w'2(0) = J fe) ax 


-b 


a ae 
7 fx (b+ x)de+— 


b(atb) *, 


2 sort 2 
~ b(a+b) |12]  a(a+b) 


b+a° = a? +b? —ab 


6(a+b) 6 


, =2 
by = B2 (0)-* 


6 


2. «sis 8 
= J x(6+x)dx+——— 


a| 


a? +b* —ab 7 (3) 
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aed x (a—x) dx 
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zal 
To find median. 
Let M be the median 
1 A 
Then Se 
aa 
0 
Now J (dr = — 
-b 
b 1 
Now < — 
Agh > 2 iff b 
0 
1 
= | fds = iff 
EN 2 
ek Mz 0 oS 
Let a>b Than 
1 
(= t= J poe 
ae 
a-b 
> = 
2(a+b) al 
= 2M 
> M=a: 


Since M is to be less than a, 


M = a- 
leta<b 
Here M<0 
1 
1 pore seks 
(1) => 2 gre 
> b(a + b) 4; 
=> pal 2 
=> M=-l 
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To find median. 
Let M be the median 


Then 


ho | 


Z 
° 
= 
—, 
Sy 
& 
= 
ll 


_ a’ +b” +ab 


18 


M 
J £0) ax. 
-b 


iff b< a. 


. M20 bga. 
Let a>b Than M >0 


1 0 
Mazes 


a-b 
2(a+b) 


= + ——_ 
a+b a(a+b) 


M 
f(x)de+ ffx) de 
0 


b M 


0 


2 
2.2 [ewe] 
a(a+b) 2 


= 2M’ — 4aM + a(a-b) =0 


> M = 


Since M is to be less than a, we take negative sign only. 


M= 
leta<b 
Here M<0 
homes 
(= 5 
> b(a+b) = 
=> = 
= M = 


[a +ab 

at ,j/———— 
2 

a(a+b) 
cr 


M 
[ G+x) de 


b(a+b) *, 


_p2 


2 
4 {rare + M 


2(M + by 


rf b(a+b) 


-b 
2 


f (a-x) dx 


| 


(1) 


...(2) 


b(a+b) 
2 
{We neglect — sign as we must have M >— b} 
If a= b since we have 


0 is the median 
Ex. 9-30. Let f(x) =ke™ (1-e), x > 0, a0 
(i) Find k such that f(x) is a density f". 
(ii) Find the corresponding cumulative distribution f". 
(iii) Find P(x > 1). 
Sol. (i) k is given by 


kf ee dx = 1 
0 


E =| 
=> =k - 
-a -2a F 
4 [i-2]-4 
a 20 2a 
=> k = 20 


be Ax) = 20e™(1-e™) © 
(ii) Cumulative distribution function is given by 


F(x) = J fx) d&x 


= 2a{ ee“ (l-e"™ )dx 


| 

0 . 

= 10  p-2ax | 
-O -2a 6 
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= 2a |- = (e%* -1) + J (ete -1)| 
Qa 2a : 


2 


= fudeo 4 20% 


(iii) P@> 1) = 1=P(<s1) 
= 1+F(1) 
= 1-{1-2e%+e"} 
= Ie % — eo, 


Ex. 9-31. The prob. density function of coded measurements of pitch diameter of threads 


of a fitting is given by 
“flx) = 


ag pe ee 
(1+ x) 
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Find the distribution functioy 

_ the quartiles of the distribution. | 
exist. 

Sol. Distribution function is 


F(x) = j 

0 
= ]_ 

and F(x) = Of 
() <. P(x>2) = 1- 


(ii) For median value x, F( 


x 1 
or a SS 
l+x 2 
x= 1 
For first quartile value x, F(x 
x 1. 
wr = tS oe 
1l+x 4 
1 
x= > 
3 


And for third quartile value x 


~ 23 

l+x 4 

i x = 3 

x CO 
Mean=| ———; : 
9 (+x)? J 
= | log 


Since as x > ©», log (1 +x) 9 

Ex. 9-32. A bombing plane c 
Ifa bomb falls within 40 feet of tr 
traffic. With a certain bomb-sigh 
function 


10¢ 


F(x) 
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mf". 


: (ere -1)| 


trements of pitch diameter of threads 
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Find the distribution function of the dist. Hence obtain (i) P(x > 2), (ii) the median and 


exist. ; 
Sol. Distribution function is given by 


| i 
F(x) = ax =|- 
(x) J (+x) fee 7 
1 
= 1-——=— forx20 
l+x +x 
and F(x) = 0 for x<0 
(i). P(x>2) = 1-P(x$2)=1-FQ)=1- 
. 1 
(ii) For median value x, F(x) = > 
Ss sco 
ae ee a 
De | 
. 1 
For first quartile value x, F(x) = a 
ete od 
‘ l+x 4 
1 
x=> 
3 
: F 3 
And for third quartile value x, F(x) = = 
ee ee 
l+x 4 
7 x = 3 
ao ; feo) oa} 
1 d. 
Mean =f Ts d= J dx — { 5 
9 (+x) 9 itx o (l+x) 


| log(1+x)|5 -1 


. the quartiles of the distribution. Investigate whether the mean and the variance of the dist. 


Since as x > ¢, log (1 + x) > 9, mean does not exist. Hence variance will also not exit. 
Ex. 9-32. A bombing plane carrying three bombs flies directly above a railroad track. 
[fa bomb falls within 40 feet of track, the track will be sufficiently damaged to disrupt the 
traffic. With a certain bomb-sight the points of impact of a bomb have the prob. density 


function 
100+x 
f@) = “x4 when ~- 100 $x <0 
100 -x 
a 104 whenQ<x<100 


=? elsewhere 
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where x represents the vertical deviation from the aiming point, which is the track in this 
case. Find the distribution function. If all the three bombs are used, what is the prob. that 
the track will be damaged ? 

Sol. Let F(x) be distribution function. 


Then F(x) = ieee ie —~ dx, if-100¢x<0 
-100 -100 
2 te 2 4 
eee er 
10 ‘is 104 
2 100+x , * 100-x | 
Fa) =f —|] f —. ifosx<100 
-100 10 
27 27 
=a 0s + 9 | fio =| 
100 0 
2 4 
= ic0eee a 
. 2 2 
F(x) =0 if x<-—100 
and F(x) =1 if x > 100 
Now prob. for a bomb to fall within 40 feet of the track 
ee f Sfx) dx 
-40 
f 100 +x | 100-x 
“ 4 4 
ty, 10 10 
2 29g 16 
= —,|100x-~-| = {4000-800} = — 
10 2 Jo 10 25 
’ Prob. for a bomb not to fall within 40 feet of the track 
wl - AG 9 
"95: 25 


’. Prob. for all the three bombs not to fall within 40 feet of the track 


9 3 

~ (35) 

. Prob. of at least one bomb falling within 40 feet of the track - 
3 
24 (=) 
25 
*. Prob. of the track being damaged = Prob. of at least one bomb falling within 40 feet 
of the track 
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Ex. 9-33. Suppose the life in 
function 

100 

Ax) = Tr 

= 0 

Find the prob. that none of th 
replaced during the first 150 hour 
original tubes will have been repla 
Sol. Let x hours be the life of 1 

150 
Then P{x<150} = f fl 


*, By compound prob. theorer 
during the first 150 hours of operati 


Also P(x > 150) = 


*. Prob. that none of the three 
operation 


Ex. 9-34. Assuming F(x) to bec 
x and given that F(x,;)=0-5, F(x2) = 
lie between x, and x2; x2 and x3. Ca 
and X, will lie between — to x, ai 


— 09 10 °°, 


Sol. Let f(x) be the density fun 


Then P{x;$xSx2) = me 


Pi{x. Sx 8x3) = J f( 


= F(x) 

Now P{—<X)< x} = Fa 
P{x, SX) <0} = 1-F 

= 1-F 
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r point, which is the track in this 
s are used, what is the prob. that 


dx, if -100 <x <0 


ream ak 
gee 


~ if0<x<100 


1 rae 
Nee = 
04 2 |, 


ack 


yp a 


16 
— ~800} = — 
a {4000 } 95 


track 


) feet of the track 
of the track 


ast one bomb falling within 40 feet 
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Ex. 9-33. Suppose the life in hours of a certain kind of ratio tube. has the density 
function 


100 
Kn) = 7 when x = 100 
x 
= 0 otherwise 


Find the prob. that none of the three such tubes in a given radioset will have to be 
replaced during the first 150 hours of operation? What is the prob. that all three of the 
original tubes will have been replaced during the first 150 hours ? 

Sol. Let x hours be the life of the tube. 


150 150 1 1 
Then P{x<150} = | fix)de=100 J = dxe== 
0 100 * 2 


.. By compound prob. theorem, prob. that all the three tubes will have to be replaced 
during the first 150 hours of operation. 


2 
Also P(x > 150) = (eee 
- 3°33 
’. Prob. that none of the three tubes will have to be replaced during first 150 hours of 
operation 


7 (2)-2 
3 27° 


Ex. 9-34. Assuming F(x) to be cumulative probability distribution function for a variable 
x and given that F(x,) = 0-5, F(x2) = 0-7 and F(x3) = 0-8, find the prob. that the variable will 
lie between x, and x2; X2 and x3. Calculate the prob. that two independent observations X;, 
and X will lie between — © to x; and x3 to o. It may be assumed that x takes values from 
— 00 10 0, ‘ 


Sol. Let f(x) be the density function. 


x2 
Then P{x,;<x<x2) = J fax 


—-oO x2 

= f fade + f fdax 
nc 

= f f(dc- | fax 


= F(x) — F(x) =0-7 —0-5 = 0-2 


: x3 x3 *2 
Ply <xsay) = J fledx= J f@yde- J fax 
oe 


= F(x3)— F(2) = 0-8 -0-7=0-1 
Now P{-— eo <X) < x;} = F(x) =0-5 
P{x3 3X) <0} = 1- Fl 0 <X,< x3} 
= 1-F(a3)=1-0-8=0-2 


or 


or 


or 


or 
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Since X, and X2 are independent observations, reqd. prob. 
= (0-5) (0-2) =0-1. 
Ex. 9-35. A distribution function is defined as follows 


0 xsl - 
1 4 

F(x) = ion l<x<3 
1 x>3 


Find the density function f(x). Find the mean of x and the median. 
Sol. Density function is given by 


1 
fia) = F@)=7 1 1<x<3 
= 0 otherwise 


3 
1 1 
Mean = ad x(x) dx = 7 


4 
at 


zs 3-—-|(-1)° [3-5 = 2-6 


1 
" or median F(x) = 3 
1 : 1 
ooo 3 

1 

a x = 1+(8)4. 

Ex. 9-36. Determine m so that the following function represents the density function 
0 x<-l 
m(x+1) -1<xs3 

$= Vm 3<x<4 
0 x>4 


Find the value of x about which the mean deviation of this dist is least. 
Sol. m is given by 


3 40 
i m(x-+1)dx + [ 4mdx =1 
-1 3 

3 
(x41) 


m + 4m|x|3 = 1 


-1 

8m+4m = 1 

2 

12 

Since the mean deviation is least about median, it is required to find median. 
Let it be ‘a’ 


m= 
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a 


1 
Then f Add = x. 
= 2 
Let if possible a > 3 
; 3 
Then m f ( 
-l 
1 
or 8m + 4m (a-—3) = 3 
1 1 
) —(a-3) = —- 
; gre 


which is not possible 


a#3 ite, a<3 
a (a+1)? = 12 


on a = 22 

Ex. 9-37. A country filling st 
volume x of sales in thousands of, 
must be the capacity of its tank in. 
given week shall be 0-01? 

Sol. Let V be the volume of c 

Then prob. that the supply wi 


= P(x 
P(x2V) = 0-01 
P(x<V) = 0-95 
V 
5{ (I-x)*dx = 0-95 
0 
or 1-(1-V) = 0-96 
or (1-V)> = 0-01 
= V = 0-6( 


Tank capacity = 602 

Ex. 9-38. For continuous vari 
from the median. . 

Sol. Let x be a continuous va 

Now by def., mean deviation 


20 


F(a)=E-a\ = | 


: —00 
Differentiating wrt. ‘a’ unde: 


a 


F(a) = J. 


oO 


and F(a) = fla) 
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prob. 


ws 


id the median. 


on represents the density function 


1 of this dist is least. 


is required to find median. 
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Th { Aadx = 
en u x 5° 
Let if possible a > 3 
: 2 a . 
1 
Then m J (x+1)dx+4m | dx = — 
2 
-} 3 : 
1 
or 8m + 4m (a-3) = 5 
ee ae | ae ee 
—(a-3) = —-8m =—-—=-=— 
- 34-9) = 5 ae Cane 


which is not possible 


F 1 
+)dx = — 
m | (x )dx 5 


-1 


a#3 ie, a<3 
=> (a+1yP = 12 


he a= 2/3-=1. 

Ex. 9-37. A country filling station is supplied with gasoline once a week. If its weekly 
volume x of sales in thousands of gallons is distributed by f(x) = 5(1 — x)‘, 0 <x < 1, what 
must be the capacity of its tank in order that the prob. that its supply will be exhausted in a 
given week shall be 0-01? 

Sol. Let V be the volume of capacity of the tank in thousands of gallons. 

Then prob. that the supply will be exhausted in a given week 


= PXx2V) 
P(x2V) =.0-01 
P(x <V) = 0:99 
V 
5{ (-x)*dx = 0-99 
0 . 
or 1-(1-V) = 0-99 
or (i-V)y = 0-01 
V = 0-602 


Tank capacity = 602 gallons. 
Ex. 9-38. For continuous variable, show that the mean deviation is least when measured 
from the median. 
Sol. Let x be a continuous variable with density function /(x). 
Now by def., mean deviation about an arbitrary point ‘a’ is given by 


foe) 
F(a)=Ele-al = J be-alf(x) dx 
. a foe) 
= J (a — x) f(x) dx + f (x — a) f(x) dx 
Differentiating wrt. ‘a’ under the sign of integration 


F(a) = J fxdx- J fede 


oO a 


and | F(a) = fla)+fla) =2fa) 
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For F(a) to be minimum, ‘a’ is given by 


F(a) = Oie, | f(x)de=J fx) dx 


—o0 
which implies that ‘a’ is the median. 


If fla)#0. F”(a) >0 (. fla) > 0) 
; F(a) is minimum when ‘a’ is median. 
If fla)=9. (x) is minimum for x = a C. ft) € 0). 


First derivative of f(x) which is not zero for x = a is of even order and is positive. 
.. First derivative of F(a) which is not zero when ‘a’ is the median of even order and is 
positive. 
F(a) is minimum when a is median. 
_ 9.2. Tehebycheff’s Inequality 
Let x be a continuous variable with density function f(x) and expected value zero. The 
variance of x is given by , 


a “Gy! Ox! 00 
c= f Pfxddr= f PAdct+ J rtodet+ J efya 
0 0 ; -Ox.t Ox.t 
ox.t 
Now | J fx) dx20 
~Ox.t 
—Oyt 00 
ot > f x fader J ef) dx 
—00 Ox. t 
—Gy ft 00 
> 050?) J fxde+ J fda 
= Sx.! 


= 6, PP {el 2 ost} 
7 =P {hl 2 050 


Put x = y-y 
LS Sa =\2 _ 2 
Then E(x) = 0 and of = E(y-y)" = oy 


1 - 
a 2 Pily-y| 2oy.t} 
which is Tchebycheff’s inequality. 


9.3. State and prove weak law of large numbers 
Let x], X2, ..... X, be n independent random variables distributed in the same form with 


mean m and x = 


Then for any fixed € > 0, 
Lt  P{|x-m|>s} = 0 
no = 
Assume that the variances of x1, X2, ......- X, exist and are equal to o 
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Then Var (x) = 


ty 
ew 


i 
ae 
a2 
oe 


| 
oy 


Let ts > e 
va 


t> 


oO 
S—— P 
nea {| 


which implies that for given € > 0,: 


Lt P{|x- 


n->o0 

Note. (1) The above law rer 

(2) The weak law of large nun 

In fact, given any positive numbe 
Pi\|x—m|ze}<sd, nz 

The weak law states that |x - 
might be that for some n it was lar 
strong law says that the prob. of 
variables which are identically d 
cases further conditions must be : 
Ex. 9-39. Define stochastic 
series of Bernoullian trials, the 
prob. of success in each trial as 1 
Sol. Def. A variate x, is sa! 
parameter 6 if given any positive 
Pix, — [> €} <8 for 

Let x), X9,.... X, be the variati 


xj = 11 

= 0i 

Then number of successes is 
m= x) 

and Ej) = 1. 
Var (xj) = p- 
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“" flx) € 0). 
f even order and is positive. 
s the median of even order and is 


(x) and expected value zero. The 


x) dx+ f x fix) dx 


Oy.t 


x) dx 


f(x) dx 


Ny 


distributed in the same form with 


are equal to o 
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2 
Xp tXo +....¥ 
pater z H - m} 
n 


= <> E {(er—m) + (erm) + cesta + (x,-m)] 


Then Var (X) =. 


2 


f . 
aor) E{(xy — m)? + E(x, -- my? + 0... + EX — my} 


.. From Tchebycheff’s inequality 


= Plis-miz | 
> € 


vne 


fe} 


nw — 
IV 


Let he 
e roe 
vn 


o2 


—z > Pilx-ml>e} 
ne~ ; 
which implies that for given € > 0, the prob. can be made as small as we please by increasing 7. 
, Lt P{|x-m|>e} =0 
nao 
Note. (1) The above law remains true even if we discard the requirement that O° exists. 
(2) The weak law of large numbers states a limiting property of sums of random variables. 
In fact, given any positive numbers € and 6 there is an n s.t. 


P{|x—m|ze}<5, n2N 
The weak law states that |x —m| is ultimately small but not that every value is small; it 
might be that for some it was large, although such cases could only occur infrequently. The 
strong law says that the prob. of such happening is extremely small. The law is true for 
variables which are identically distributed under the sole condition that 1 exists; in other 
cases further conditions must be added. 


Ex. 9-39. Define stochastic convergence of the variate and show that in an infinite 
series of Bernoullian trials, the proportion of successes converges stochastically to the 
prob. of success in each trial as the number of trials increases indefinitely. 


Sol. Def. A variate x, is said to. converge stochastically (or in probability sense) to 
parameter @ if given any positive numbers € and 6 there is Ns.t. 


P{\x, -0|2e} <8 forn>N 
Let x;, X2,.... X, be the variates 
x; = 1 if ith trial results in success 
0 if ith trial results in failure 


Then number of successes is given by 
m= XytxXet TX, 
and E(x) = 1.p+0(1-p)=p, EQ@’)=p 
Var (x;) = P-P=Pq 
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where p is the prob. of success in each trial 


or 


E(m) = np and Var (m) = ngp 


by 
lOO 
=|3 
eee 
| 


Dp 
vu u Pq 
Also Var|—| = —> Var(m) = — 
n n n 
From Tchebycheff’s inequality 
p{|E-» ajee he 
n t 
Let 2 =e 
n 
p\|2-elze} < 2 
n n& 
P| 7 p 2 0| < 6 
n 
When n> LA. 
6°35 


Then N = integer > AS 
— 88 


EXERCISES 
. Find the variance of the distribution. 
1 ; 16 
dF = — xsinx O<x<n Ans. 2 - — 
us T 


. Ifftx%) = be ™, 0 <x <co where bisa positive constant. Find mean, 2 and p13. 


1 1 21° 
Ane 
* be 3 | 


. Find pg, 3 and py for the distribution 


a ; 2 4 
he -asx<a Ans. colli ee 
2a 3 5 
. For the distribution 
e* 
dF = x" —- m20,0Sx<© 
m! 


show that H.M. is m. 


. Find mean, mode and median of the distrinuuon 


dF = sinx dx OSxS> [ Ans. 15,2 
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6. Find the mode and the medi 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


y = 


. Find the moment generating 


Ax) = 


and deduce the mean and v 


. Show that ifx is a random v 


Pla<x<b} = 
then a $ E(x) § b and var (x 


. Find the mean deviation fro 


dF = 
and show that it is minimun 


x is arandom variable with : 
‘a’ is the median. 


Does there exist a random \ 
P 
Sol. By Tchebycheff’s inec 
P 
Put r= 
=> P&-20,<Sxs x +: 


ie, 0-6 20-75 whichi 

there does not exist 
If.X is a random variate wit 
and. P(X <0)=0, show 


1 
P(X > 21) $5 


For the distribution given t 
Ax) = 


find the mean and variance 


Let x be a random variab! 
minimized when b = p. 
For the continuous distribu 

fix) = 
find E(x) and var(x). 


For a random variable witt 


F(x) = 
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6. Find the mode and the median of the curve 


7 
8 
9 
10. 
11. 
16: 
Ans, 2 - — 
| , =| 
Find mean, [ty and LW. 
27° 12. 
| Ans bari Be” =| 
2 4 
Ans, Lith 0, aa 13. 
3 5 
14, 
15. 
ee | Ans. 4 S52 
2 3 
16. 


abx™! 
y= (14 bx)? b>0, a>1, OS x< 


. Find the moment generating f” of the distribution 


Lo 
fx) = 5 ee" —o<x<0o 


and deduce the mean and variance. 


. Show that ifx is a random variable such that 


Plasx<b} = 1 
then a $ E(x) < b and var (x) < (6- a)’. 


. Find the mean deviation from ‘a’ of the distribution 


dF = e*dx, x>0 
and show that it is minimum when ‘a’ equals the median of the distribution. 
x is a random variable with a probability density. Show that E|x — a| is minimum when 
‘a’ is the median. 
Does there exist a@ random variable x for which 
P&X-26, Sx Sx +20,} =06 
Sol. By Tchebycheff’s inequality 


P {\x-3<ro,}21-— 
; 
Put r=2 
_ = 1 3 
=> PR 2p Se Seale pelo OI 


0-6 20-75 which is not possible. 

a there does not exist any random variable with given prob. 
If.X is a random variate with E(X) = 

and. P(X <0) =0, show that 


Lé., 


1 
P(X > 2) <5 


For the distribution given by 
Ax) = [1-2] 10,2 @) 


. 1 
find the mean and variance. | Ans mean=1, variance= 5 


Let x be a random variable with mean p and variance o”. Show that E(x — by is 
minimized when b = p. 
For the continuous distribution with p.d.f 
fx) = % oe To, ) 11 
find E(x) and var(x). ans (; ; )| 
For a random variable with c.d.f 
| FQ) = (1-pe™) lo.) 
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find mean and variance. 
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[Ans P mem 
Cae 


17. Ifx is a random variate for which 


P(x <0) = 0 


show that 


and FE (x) ="U<o, 


Pix<ut] = 1- ., for every t= 1 


Sol. Assume x is a continuous variate with p.d.f f(.). 
ie.6) 


L = E(x) 
> P(x > ut) 
=> 1-— P(x < pt) 

P(x Sut) 


IV 


IV 


lA 


IA 


IV 


J xf(x)dx 


—- oO 


J xf@ae+ fo xf(xyde 


x2pt XSpt 


J x fix) dx 


x>pt 


J ut f(x) dx = ut Pix > ut] 


x2yut 


E 
t 
J 
t 


1 
l7. 
t 
OO) 


Theor 


10.1. Binomial Distribution (B 
Binomial Probability Dist 
P(x): 

The variate x is called Bim 


the distribution. 


Binomial Frequency Distr 
F(x): 
where N is the total frequency. 
Deri-ation . 
Let there be N sets of n ind 
trial be p and of failure is g. The 


Let-us first calculate the chz 
Let us find the probability of ot 
theorem of compound probabili 


and the probability that the rem: 


.. The probability of joint 
trials failures 


Clearly this is also the prot 
any particular definite specified | 


and x trials can be chosen out of. 
probability, the probability of x 


The chance of getting x suc 


one set "c, p*q"™* sets will hay 
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(x) dx 


> pt] 


Ans. 
[amt 


Pp ee 52 
> 2 


OO) 


Theoretical Distribution 


10.1. Binomial Distribution (B.D.) 
Binomial Probability Distribution. The B. PD. of the variate x is 
POR Sop go eS ONG Qycssccases n 
The variate x is called Binomial Variate (B.-) and n and p are called parameters of 
the distribution. 
Binomial Frequency Distribution. The B.F-D. of the variate x is 
FID AN "eop g” * x=), Qian 
where N is the total frequency. 
Derivation 
Let there be N sets of n independent trials. Assume that the chance of success of each 
trial be p and of failure is g. Then 


ptqe=l. 
Let us first calculate the chances of obtaining 0, 1, 2,.....successes in one set of 7 trials. 
Let us find the probability of obtaining x successes and (— x) failures in n trials. By the 
theorem of compound probability, the probability that first x trials are successes 
. = px px p..x times = p” 
and the probability that the remaining (1 — x) trials are failures 


x % 


‘n= 
=4q 
.. The probability of jointly getting first x trials success and the remaining (n— x) 
trials failures 


Clearly this is also the probability for the x successes and (n— x) failures to occur in 
any particular definite specified order. Since we are interested in any x trials being successes 


and x trials can be chosen out of min "c, (mutually exclusive) ways, by the theorem of total 
probability, the probability of x successes in a series of x trials is given by 
P(x) —_" c,p'q” * 
The chance of getting x successes in one set of v trials is "c,p*q” * means that out of 


x 


one set "c, p*q"~* sets will have x successes. 
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& Out of N sets N "c,.p*g”™ sete will have x successes. 

.. The frequencies of getting 0, 1, 2,...successes in N sets of n trials each are 
Ng", N"c,pq"?, N "cyp7qr?.....N "Cap": 
10.1.1. First Four Moments About Mean | 

Binomial Probability distribution is 


P(x) = "co p*qn*,x=0, lich 
n n 
wi(0) = DsPQ) =>. x"e,p"q"™ 
x=0 x=0 
=n n-1 n 2 n-2 i i 
“Pa +2 Cop q ~t...+n ¢,p 
= np {q"! +(n-1) pq”? +...4p""} 
= np (q+ py 
= np 
us n 
30) = DP) = Dx 4+x1 "epg" 


x=0 x=0 
no n 
Z Sox "eypiqh t+ x ne ptqh 
x=0 x=0 
= 21 "cop*q”* +3.2"cgp°q"’+...4n(n—Lp" +np 
= n(n-1) p* (q”? +(n—2) pq? tectp? )+np 
= n(n-1)p? (q+p)"? +np 
= n(n—-1) p? +np 
He = 12(0)—{u{(O}? =n(—1)p? +np —n? p? 
= np (1— p) = npq 


n 


SY) P(x) 


13(0) = 
x=0 
Put xo = Ax+ Bx(x—-1)+ Cx(x-1) (x—-2) 
Equating co-efficients of 3, x? and x 
| é=i 
B-3C=0 => B=3 
and A-B+2C=0 > A=l 


x? = x4+3x(x-1) + x(x—-1)(x—-2) 
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, 


Ha 


xt = x(x-1)(x-2)(x-3) 


2 nyo) = ee 
; x=0 


= n(n-1)(n-2)(n-3)p* + 
bg = 1040) — 45 Oni ( 
= {n(n-1)(n-2)(n-3)p4 - 
~A(n(n-1)(n -2)p? + 3n(n 
=(n* —6n? +11n? - 6n)p* 
-(n* - 3n3 +2n? )p* -120 
= 3n” p* - 6np* -6n’ p* + 
= np{(1-7p+12p* - 6p) 
= np{(1— p)(1—- 6p + 6p”) 


= npq {1—6p(1- p) + 3npq 


= npq {1+ 3pq(n-2)} 
Ex. 10-1. For binomial dist 
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i€S. 


in N sets of n trials each are 


‘x-1)+x] "c,p*q"* 


n 
n-x + ox "c p*qr* 
x=0 


"e,pq” > +..4n(n-1)p" +np 
n-2) pq” >+...4p""") +np 


=a +np 


n(n -1)p’ +np -n*p? 


(x-1)(x-2) 
3=3 

A=1 

x(x -1)(x-2) 
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u3(0) 


Hs 


14,(0) 


Set Bx(e= 1) + xe -Dr-2)} "nyp*q”™ 


x=0 


np + 3n(n—- lp? +n(n—-1)\(n-2)p 


(0) — 3415, (0) 11 (0) +2 {u(0)}? 


np + 3n(n—- 1)p? trntn-Din- 2)p? 


—3{n(n - lp? +np\np + 2n} p? 


np+3n p- - 3np” + np? - 3n* p? + 2np? 
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-3n' p? +3n? p? - 3n? p? +2n>p?* 


np (1- 3p+2p’) 
np(1— p)(-2p) 
npq(q- p) 


n 
x* P(x) 
x=0 


xt = x(x -L(x-2)(x- 3) + 6x(x - I(x - 2) + 7x(x -1I) + x 


n 


x=0 


. w,(0) = SY) x(x = (x - 2)(x- 3) + 6x(x- 1)(x-2) +7x(x-1)+x}." c,p*q" 


= n(n—1)\(n-2)(n-3)p* +.6n(n-1)(n-2)p* + 7n(n—-l)p* +np 
“pg = 104 (0) — 45 (0); (0) + 6415 (0) {uj (0)}? - 3} (0) }* 
= {n(n-1)(n—2)(n—3)p* + 6n(n-1)(n-2)p? + Tn(n- 1p? + np} 


-A{n(n -lDa- 2)p° +3n(n- l)p’ +np\np +6{n(n— l)p” +np\n* p” = 3n* p* 


= (nt —6n? +11n? -6n)p* +6(n7 —3n? +2n)p* +7(n? —n)p +np 


~(n* —3n3 +2n* )p* -12(n3 -n*)p? -4n? p? +6(n* ~n°*)p* +6n>p° -3n' p* 


= 3n* p* - 6np* oa 6n’ p? + 12np? + 3n? p* - Tnp* +np 


=np{(l-7p+12p" —6p*)+3np(1-2p+p*)} 


= np{(1- p)(1- 6p + 6p”) +3np(l - p)”} 


= npq {1-6p(1— p) + 3npq} 
= npq {1+3pq(n-2)} 


Ex. 10-1. For binomial distribution show that 


du, 
Mey = Pala fe 
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and deduce the values of UW, U3 and 4. 


Sol. Binomial distribution is 


P(x) = 
By df. L, = 
du, 
dp 
+p* (n—x)q? 
n 
= Lop gs 


x=0 


‘dk n r+l 


= — Hyyp ~My} 
Pq 


Bra = 


Put r= 


Ha = 


"épg? * £01 Zach 
E(x —npy” 


n 


Sepia *(x=npy 
x=0 


n 


>» Me {xp*g""*(x—npy 
x=0 


d : - ra 
po) + p*q"*r(x—np)y\(-n)} 


(x np)’ {xq - p(n-x)} 


c,p'q” *(x—np)'* —nrp,., 


1, 2 and 3 


( Ho = 1, y= 0) 


2n 4 ibe =n, uy } 
Pq) enyy do ea ap Pq 


=npq (q - Pp) 


d d 
Pq {on = a =npq > +— {pq(q - Pp 


dp 
npq(3npq +(q- p)” ~2pq] 


npql(q+ py +3pq(n-2)] 
npg [1+ 3pq(n —2)). 
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Ex. 10-2. For a binomial varia; 
Head 


where \!. = E(x’) and r is a non-ne 
Sol. By def. 


' 


My 


ay, 
dp 


, 
Herat 


10.1.2. Measures of Skewness 
Sol. Measures of skewness 


Measure of kurtosis 
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Ex. 10-2. For a binomial variate x with parameters n and p show that 


di, 
Hear = _ np, + pq——_ dp ' 
2,0 
where 1) = E(x’) and r is a non-negative integer. 
Sol. By def. 
= E x” 
at HM; (x") 
n 
= ye eophgh 
x=0 
c—np)" 
du}. ~ 
| Pa "eo [xp tg" -(n- xq’ p*] 
| x=0 
n)} 
gy 
dp 
n 
_ pce "ec p*'q -1 n-x-1 [xq - (n- x)p] 
: 49 =-] x=0 
dp — 
1 n 
= — Six" "c, p*g Eq * (p= np) 
Pq x=0 
1 n n 
ean Ds "co pig’? ~npy x’ " c,p*q"* 
Pq x=0 x=0 
1 t Ll 
Stora {brat —npu;} 
Pq 
' dy, 
Hat = MP be + pq. 
dp 
Pq (Ly = 1, a, = 0) 10.1.2. Measures of Skewness and Kurtosis 
Sol. Measures of skewness 
d 2 
npg— = F3 
ae {pq} = 71, = vB) = 3 
1pq(q- p) 
Pq — P Leslie 
d 3? 
npq | ano +p (Pag P) ] : 
_ "Pq(G—P)_4-P. 
—2pq] (npqy”” V"Pq 
n—2)] 25 4 Bs 
- Measure of kurtosis = B, == 


Y2 = B.-3= 
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a(npalt Spat =2)) 
(npq)” 


_ 1+3pq(n-2) -, , 1-6p4 


npg "pq 


1-6pq 
npq 


10.1.3. Mean Deviation about Mean 
Mean deviation about mean is given by 


Now pt, =0 


n = Elx—np| 


Hl 


n 
Sxl eyptat 
x=0 


n 
+ DG -np) "ey p*q”* =0 


x=0 


= Yi(-np) "¢.p q" 


x>np 


x>np 


= 2a Sen —t,} 


x>np 


where 


: ni 
es ces $ 


= > (np-x) "cy p*q’* 


x<np 


n = 2) (x-np)"c, p*q"™ 


x>np 


= 2) (xg-(n—x)p) "ex p*q”™ 


x>np 


= 2 > (x "e,p%q" —(n-x) "exp 


x>np 


! = 
n-x+ = Liars. pig: ‘| 


xi(n—x-D)! 


n!} ae n-x 
tx © xl(n—x-1)! 


Let } = greatest integer contained in np +1 


Then 


n= 2c rat 
x=p 


3 > (x-np) a es Ye - x) "c,p*q"* 


x>np x<np 
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10.1.4. Mode of the Binomi 

In binomial distribution the | 

P(x) 

The values of x together with 

' The mode is that value of x for wh 
Leé., 

P(x-1)S P(x 

Consider P(x-1) $ P(x) 


oe "oe peg" 
n!} 
on (x-Di(n-x+ 
or xq S(n+1) p-.: 
or xqt+p)s(n4+1) 
or xS(n+Dp 
Similarly other inequality giv 
x2(n+1)p—1 


From (i) and (ii), modal value 
(n+l)p-I<x< 


Casel: If (n+1)p=k isan 


P(x=k) 


ee Pix=k-1 


 PXxak 
. Inthis case P(x) increases t 
decrease. 


“.x=k and x=k-1 are two 
Case II: If (n+l) p=k is no 
(n+1) p=a 
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pig! + Dinp—x) "exp'a"™ 


x<np 


-x)p)"c, p*q”* 


x+l on-x 


qh! —(n- x)" CyP 4 } 


x41 n-x 
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Z 2.n! 
7 ey 


Spd er 


pig?" 


= 2npq {" 
10.1.4. Mode of the Binomial Distribution 
In binomial distribution the probability of x successes is given by 
P(x)=" c, p*q”*, x=0,1,2,...0 

The values of x together with the corresponding probabilities form binomial distribution. 
The mode is that value of x for which P(x) is greater than or equal to P(x—1) and P(x +1) 
Le., 

P(x-1)< P(x) > P(x+h) 


Consider P(x-1)¥¢ P(x) 
He ec, peigt ts” c, p*q™ 
, n! n!} 
ty @—DMn—x4+Dl? ala? 
or xq <(n+1) p—xp 
or x(q+p)s(n+l)p 
or xS(n+l)p ..- (2) 
Similarly other inequality gives 
x2(n+l)p-1 .. (ii) 
From (i) and (ii), modal value x satisfies the inequality 
(n+l)p-lsx<(n+l1)p ...(iii) 
Case I: If (n+1)p=k is an integer, then (n+1)p—1=k-1 is also an integer. 
P(x =k) "cy pig” 
Now P(x=k-l) "oy, phigh 


n! (K-D!(n-k +1)! p 
~ kMn-k)! n! q 
_ (t+) p—kp _kd~p)_ 
kq kq 
C P(x=k) = P(x=k-1) ...(iv) 
. In this case P(x) increases till x =k —1 and then (iv) holds and after that it begins to 
decrease. 

“.x=k and x=k-1 are two modes. 
Case II : If (n +1) p =k is not an integer, let 


(n+1) p=a (an integer) + f(a fraction) 
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when x takes the vale ‘a’ (which is obviously less than k and greater than k-1) from 
(i) and (ii) . 
P(a-1) < P(a) > P(a+1) 
ae x =a (greatest integer less than k) is the mode. 
Ex. 10-3. /fnp be a whole number, the mean of the binomial distribution coincides with 


the greatest term. 


Sol. If np is a whole number, then since p is a fraction, np is the greatest integer less 


than np + p=k. 
-, From Case II, mode = np = mean. 
Ex. 10-3. (a), [fx is the unique mode of the B.D., show that 


(ntl) p—-l<x<(n+l)p. 


10.1.5. Moment Generating Function 
M.GF.,, by def. is given by 


n 


M,(t) a Ete") os boa "o p*q"* 
; 0 


x= 
n 

a> eeey a Sarre) 
=0 


M.G.E. about the mean ‘np’ is given by 
M,(t) = E(e™) =e" M(t) 


e "(q+ pe')” 
= {qe + pe"}". 
Deduction of moments about mean 
M,(t) = (qe™ + pe”)” 


st, ts oad gt GP at : 
= 3q\l-pt+P a? at? rate +p\ l+qt+q ae ae Pt 


a 22 ho Sack ‘ 
= 1+ pa + Pa —p Jee we? +4q orien 


? 2B ‘ ‘ 14 ‘ 
= 1+ pas, + PATH Pz + PUP —patan)ai te 
2 3 4 
3 L4pyttbe ate ae aa 
SoCRE Nees erat ea 


2 3 4 
t t t 
= l+npq5, +npq (q-P) 3 + [npq {p" —pqrt q°} +3n(n—1)p'q? 1 +- 


yt, = 0,2 =npq, 3 = "Pq (q— P) 
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‘ 


and 


Ex. 10-4. [s the sum of two i 
what are the conditions under w. 


Sol. Let x, and x, be two: 


Ny, Pz Tespectively. 


Then M,(t) of 
and M,(t) of 

Let 

Then M,(t)o 


which being not of the form (q - 
If p, = p. = pso that g; = 
Then M,(t) of x= (q + pe 


which implies that x is a binomi. 
required condition is 


10.1.6. Cumulative Functi 


By def. cumulative f” is g: 


Ko(t) = logs 


= nlog 


= nlog 


at 


But Ko(t) = kyt- 


k,(O) = ap,k 
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ie nee anne 


-and greater than k—1) from 


s than k) is the mode. 
mial distribution coincides with 


1, np is the greatest integer less 


v that 


‘= (q+ pe)” 


M(t) 


4 
t 
+3n(n—Np?g? + 


= npq (q- Pp) 
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and Ha = npg {p’ —pq+q? +3(n-))pq) 


= npq {1+3(n~2) pq}. 
Ex. 10-4. Js the sum of two independent binomial variates a binomial variate ? If not, 
what are the conditions under which it is so ? 


Sol. Let x, and x, be two independent binomial variates with parameters n,, p, and 


Ny, P2 respectively. 


Then M(t) of x; = (q, +pye'y" 
and M(t) of XxX, = (q2 + pre’) 
Then M(t) of x = {Mo(t) of x,}.{Mo(t) of x2} 


= (q, + pie)" «(qa + pre’) 
which being not of the form (g + pe’)” implies that x is not a binomial variate. 
If p, = p2 = pso that gq, =4q. =q 
Then Mj(t) of x = (q + pe')"*™ 


which implies that x is a binomial variate with parameters (n, +.) and p. Therefore, the 
required condition is 


P| = P2- 
10.1.6. Cumulative Function and Cumulants 


By def. cumulative f” is given by 


Ko(t) = log M(t) =n log (q+ pe’) 


2 3 4 
logyq+p ie 4 ee 
r ya ay | aa 


It 


2 3 4 
= nlog tip p sp oa pe 
. 210 Bh AL 
2 3 4 1 2 3 2 
= t+p—+p—+p—t...p-—4 ptt p—+p—t 
= 4p Par PS Pa 7 p Px Px 
3 
ere ae —— {pt+...J4+ 
3 P Px ” Ptt.. po tees 
a ra 
But K,(t) = Me Pla a hi oesee 


k,(0) = np, ky =npq, 
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"ks = n{p—3p? +2p"} = np {l-3p+2p"} 
= np(1-2p)(1— p) = npq (q- P) 
ky = nlp-7p? +12p? -6p*}=np (1-Tp+12p° - 6p} 
= np{(l- p\(l-6p+6p’)} 
= npq{l—6p(- p)} 


= npq {l-6pq}. 
Ex. 10-5. Show that for the binomial dist. with parameters n and p. 


dk, 
Pe aa | dp 


k 
Hence deduce the values of k,k3 and k 4. 
Sol. For B.D., M(t) = (q+ pe')” 


Ko(t) = log Mo(t) =n. log {q + pe'} 


k, = 2 tog.a+ re) 
dt 


t=0 
dk, E e-1 | 
= n|— 
r f 
dp dt’ |q+pe ))\_4 
qi*! 
Also kay = "| yz {log (a+ pe )} 
dt ee 


E | pe’ \ 
ed ree rs 
dt’ \|q+pe wr 
dk d’ | pe’ — pqe' + pq 
Kp Pg SB 
dp dt qt pe an 
E {earn} 
oF) ooh 
dt pe +d Jj). 


7 F | =0 
dt han 


dk, 
dp 
Put . r= 1,2, and3 


key = PD 
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Ex. 10-6. /f a coin is tossed 


probability of exactly - —X head. 


Sol. Let the occurrence of a h 

Then 

.. Probability of x successes 
PC 

Since n is an even number, le 


where k is a positive integer. 


PC 


P(k-a 
ptk) 


Now 


By Stirling’s formula. 
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2p” dk 
oe ky = P9—~ = npq as ky = pj (0) = np 
dp 
~7p+12p* -6p"] dk, 
k, = Pa, Paap) 
lp 
dk, 
and ky = PI ay "PA {q(q- p)~ p(q- p)-2pq} 
2ters n and p. = npq {(q +p) -6pq} 
= npq {1—6pq) 


Ex. 10-6. /f a coin is tossed n times where n is a large even number, show that the 


probability of exactly 5 —X heads and ; +x tails is 


= 2x? 
(q+ pe'} (2. par 
m™ 
’ »| | Sol. Let the occurrence of a head in a toss be called success and p be its probability. 
t=0 
Then S = i =q 
2 
“. Probability of x successes is given by 
'=0 : 1 n 
3 et P(x) = “cy. 3 
ve')} 
s Since 7 is an even number, let 
fe n=2k 


where k is a positive integer. 


1 2k 
1=0 : ie P(x) = *e, (=} 


cal . 1) 
? 2k — 
t=0 P(k - x) caf ) 


Now = 


(k) 2k 
eG 
2 
t=0 


a (2k)! ktk! 
(k= x)'(k +x)! (2k)! 


kik! 
(k= x(k +x)! 
By Stirling’s formula. 


1 
= Jom et 2 
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1 1 
-k kts =f ae 
P(k-x) V2ne*k. 2 V2ne*.k 


P(k) 


pe eer ke ae 
V2ne***(k-x)  2.y2ne**(k+x) ? 


Akh 


per) yee, 
(k-x) 2 (k+x) 2 


pg = {e-s+3) log(1-7 {k +x43) log (142) 
P(k) 2 k y) k 


i] 
a 
= 
i] 

& 
+ 
J 
Nea 
SN 
as 
+ 
Nl 
| 
nN 
+ 
) a 
oN 
= 
+ 
& 

+ 
Nir 
Nene” 
fon oN 
als 
| 
Nile 

eb 
iS) N 
NS” 


1 
neglecting terms containing a and higher powers of ip as k is large. 


2 
x 


Pk—x) = Plkje * 


R 


Seo = 

rea oa al 
2k+— 

J2ne 7 (2k) 


1 
met | 


NI pl 
TT 
Ne 
a” 
Ww 
~~ 
as) 
1 
|® 
aly 
nN 
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Ex. 10-7. Six dice are throv 
dice to show a 5 or 6 ? 

Sol. Here N = 729, n =6 

Let the occurrence of 5 or 6 

Now p = prob. of occurrenc 


. Prob. of x successes is g: 


] 
By theorem of total probability, 


.. No. of times at least thr: 


Ex. 10-8. A perfect cubic dic 
of 5 or 6 is called a success. In 
Sol. Here 


«. Probability of x success 


P(x 
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ee 


1 
kt yp kts 
2 J2ne*.k ? 


1 
X+— pe ktxt a 
2. J2ne**(k+x)  ? 


as k is large. 


= 2x? 
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Ex. 10-7. Six dice are thrown 729 times. How many times do you expect at least three 
dice to show a 5 or 6 ? 
Sol. Here N = 729, n =6 
Let the occurrence of 5 or 6 be regarded as success and p be the probability of success. 
Now p = prob. of occurrence of 5 or 6 


age 
6 3 
gaps. 
. Prob. of x successes is given by 
6 6-x 
J 2 
By theorem of total probability, probability of at least three successes 
= P(3)+P(4) +P(5) + P(6) 
| 
| = = (805.23 +9 c4.27 +6 c5.24° c6} 
= ey cbs 8+ 562 4462+ 1 
729 | 3.2.1 43.2.1 | 


= | 16046041241} 
729 


233 
729 
~. No. of times at least three successes occur 


= ae 729 = 233. 
729 


Ex. 10-8. A perfect cubic die is thrown a large number of times in sets of 8. The occurrence 
of 5 or 6 is called a success. In what proportion of the sets would you expect 3 successes ? 


| 
Sol. Here n=8 
p = probability of success 
= probability of occurrence of 5 or 6 
aoe 
6 3 
2 
= |-p=l--==-— 
q P 3 


a TV yee 
PO) EB gf ga 


— $¢5.2° 876 _ 32 
P(x = 3) = 38 "32 81x81 


. Probability of x successes is given by 
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_ 12. 0-2731. 
6561 
.. Required proportion = 0:2731 = 27-31%. 
Ex. 10-9. Assuming that half the population are consumers of rice so that the chance of 


1 
an individual being a consumer is > and assuming that 100 investigators each take 10 


individuals to see whether they are consumers, how many investigators would you expect to 
report that three people or less were consumers ? 


1 
Sol. Here N = 100, n = 10, aa 


.. Required number = 100 {P(3)+ P(2)+ P(1) + P(O)} 


100, 
7 pio test me, + We, + a 
7 100 | 10.9.8 4109 041 
1024 | 3.2.1 2.1 
= ei {120 + 45+ 10+ 1} 
1024 
1024 


Ex. 10-10. An irregular six-faced die is thrown and the expectation that in 10 throws it 
will give five even numbers is twice the expectation that it will give four even numbers. How 
many times in 10,000 sets of 10 throws would you expect it to give no even number ? 

Sol. Let p be the probability of an even number and q = 1-p. 

Then the prob. of getting five even numbers 


= 10 ox pg? 
and the prob. of getting four even numbers 
=o pigs 
By given, 
Oo. pg? = 2%, p*q® 
or 3p = 5q=5-5p 


5 3 
>and g =— 
Cage eg 


. 10 
3 
10 
”. Required number = 10900 a2 | 


order to destroy it completely. 
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Ex. 10-11. /n a precision bom 
strike the target. Two direct hits « 
bombs must be dropped to give a § 


Sol. Here 


Let n be the fected number 
PC 
where P( 
1- P( 
0-012 P( 

or 


The value of n is the least po 
Putting 


which is not true. 
Putting 


which is true. 


Ex. 10-12. Show that if two 1 


_n (and of the same number of o. 


coincides with the (r +1)th term. 


terms is a symmetrical binomial « 
Sol. Let 'V be the number of 
distribution are 


v{7) Nc f 
2 ‘ 
. rth term of the first distrit 


and (r +1)th term of the second 
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s of rice so that the chance of 


0 investigators each take 10 


stigators would you expect to 


+ Co} 
10 vf 
}} 


yectation that in 10 throws it 
rive four even numbers. How 
give no even number ? 


—p. 
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10 
ee 10000{ 2} 
8 


= 0-549 x1. 
Ex. 10-11. Jn a precision bombing attack there is a 50% chance that any one bomb will 
strike the target. Two direct hits are required to destroy the target completely. How many 
bombs must be dropped to give a 99% chance or better of completely destroying the target ? 


1 
Sol. Here p= Fi =q. 


Let n be the required number of bombs. Out of n at least 2 bombs must hit the target in 
order to destroy it completely. 


P(2) + P(3) +...4 P(n) 20-99 
say (LY 
where P(x) = ¢x a]? 


1-P(0) — P(1)>0-99 


n 


I c 
0-012 PO) + PU)=—- +52 
or 100 (n+1) <2” 
The value of n is the least positive integer satisfying the inequality. 
Putting n=10 | 


1100<2'° or 1100<1024 
which is not true. 
Putting n=11 

1200<2!! or 1200 < 2048 
which is true. 
; n=11. 


1 
Ex. 10-12. Show that if two symmetrical binomial distributions [> =O 3] of degree 


_n (and of the same number of observations) are so superposed that rth term of the one 


coincides with the (r +1)th term of the other, the distribution formed by adding superposed 


terms is a symmetrical binomial distribution of degree (n +1). 
Sol. Let NV be the number of observations. Then the successive terms of the binomial 


distribution are 
n n n nv 
1 1 
N. = tNAC; zs settee N Cp) H | eeeeeeee N "e,| = 
2 2 2 2 


1 
.. rth term of the first distribution = N." c,-) an 


1 


and (r+1)th term of the second distribution = N."c, an 
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& — ee t ps = ss — 
N 
. Sum = get orn oe. } 
1 Ex. 10.14. Each of two perso 
= N. da — obtain the same number of heads 
n i 
2 Sol. For each person, 
n+l 1 ! 
2N. Cc, eer | 


which is the (r +1) th term of the binomial distribution 


n+l 
2N. a e 
2 2 


_which is symmetrical binomial distribution of degree (n+1) and total frequency 2N. 

Ex. 10-13. Eight mice are selected at random and they are divided into two groups of 4 P( 

each. Each mouse in group A is given a dose of certain poison ‘a’ which is expected to kill 
one in four; each mouse in group B is given a dose of certain poison ‘b’ which is expected to 
kill one in two. Find the probability that the deaths in groups B are lesser than in group A. 


. Prob. of x heads is given t 


Since two persons toss inde 


Sol. For group A, 


n=4,p = 


.. Prob. of x successes is given by 


| 
> 
cad 
a 
AIR 
cad 
———~ 
pl w 
Nee” 
. 
Rad 


P(x) = 
fee ay 
256 
For group B, 
ee eee 
n »P 2” q ) 


. Prob. of x successes is given by 
4 1 x l 4-x 
Ax) ="&l5] [5 


16 
Reqd. prob. = Q(0) {P(1) + P(2) + P(3) + P(4)} + QU) {P(2) 
+P(3) + P(4)}+Q(2) {P(3) + P(4)}+ Q(3)P(4) 


L444. 43, 4 2,4 4 
=—— 3° + 75.3% + "C3.34+ °C 
4096» a 2 3 4) 


4 4. (4 
+4. 9.3” ae BAe a ek 3.34 4c4)+ c3.(c4)} 


i: 
= —— ((108+544+124+1 + 4 (544+124+1)+6112+1)+4} 
7006 (34 )+ 6( 


Reqd. prob. = P(3)P(3) + Pl 


i} 
—_<£=__—x 
a 
N|e 
Niacin 


1 
—~ {I 
64 

Ex. 10-15. Find the probabi 
failures to the probability of exac 


Sol. Let p be the prob. of su 


Then prob. of r failures = "¢ 


and prob. of (n—r) failures "c,, 


.. Ratio 


This ratio can be independe 


or 


Ex. 10-16. Bring out the fi 
binomial distribution is 5 and it: 
Sol. Here np = 5 


and vi 
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+1) and total frequency 2N. 


ey are divided into two groups of 4 
1oison ‘a’ which is expected to kill 
ain poison ‘b’ which is expected to 
oups B are lesser than in group A. 


A) {P(2) 
(3)P(4) 


3 

Pe nee Bo eer tg 4e4) 
4 4. 74 

$+ “eg) + “c3.(e4)} 


+1)+6(12 +1) +4} 
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ed 
~ 4096" 
Ex. 10.14. Each of two persons tosses three fair coins. What is the probability that they 
obtain the same number of heads ? 


Sol. For each person, 


n=3 
. : 1 
p = prob. of head in a toss of a coin = 7 
1 
=l-p=— 


. Prob. of x heads is given by 


l 3 
P(x) = up ae = ‘(5 


Since two persons toss independently, 
Reqd. prob. = P(3)P(3) + P(2)P(2) + P(L)PC) + P(0)P(O) 


Doyo egg hace 
64 16 


Ex. 10-15. Find the probability of success, if the ratio of the probability of exactly r 
failures to the probability of exactly (n-r) failures in n trials is independent of n. 


Sol. Let p be the prob. of success and q =1—p 
Then prob. of r failures ="c,q" p"” 


and prob. of (n—r) failures "c,_,g"" p" 


n-2r 
.. Ratio = (2) 
q 


p 
een 
q 
or p=q=l|-p 
=o 
p~ 5" 


Ex. 10-16. Bring out the fallacy, if any, in the following statement. The mean of a 
binomial distribution is 5 and its s.d. is 3. 
Sol. Here np = 5 


and vnpq =3 


npq = 9 
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TT 
or 5q=9 

q=18 


which is not true as probability is to be less than unity. 


Ex. 10-17. /fon an average | vessel in every 10 is wrecked, find the probability that out 


of 5 vessels expected to arrive 4 at least will arrive safely. 
Sol. Let the arrival of a vessel safely be called success, 


Then Pp = prob. ofa vessel to arrive safely 
= (1-prob. of a vessel to be wrecked) 
a ee eee 
10 10 
cs 
7” 10 
Here, n=5 
5 x 
c,9 
P = 
(x) 103 
Required prob. = P(4) + P(5) 
= — (e494 +99) 
ie 


4 4 
me CRE), che ») = oe =0-91854. 
10 10 
Ex. 10-18. ‘m’ things are distributed among ‘a’ men and “b’ women, show that the 
chance that the number of things received by men is odd, is 
1 (b+a)" -(b-a)” 
2° (b+a)” : 
Sol. If one thing is distributed, prob. of a man to get 
a 
a+b 


and prob. for a woman to get it 


b 
a+b 
. Out of m things distributed, prob. for men to receive ‘r’ things 


ma a r b iF 
= "¢,{ — 
at+b a+b 


.. Prob. for men to receive odd number of things 


Wola) ag 


where summation extends over odd values of r from 0 to m 


- {("c,ab™ +" c, wpa cya b+.) 
(at ” 
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7 1 (b+a)” -(b- 
(a+b)” 2 
Ex. 10-19. Mean of a bino 


1-92. Find other constants of th 
Sol. Here 


qq 

or q(2q 
or 2q? -q-0 
or (2q +0-6) (q-C 


. From (2) 


Mode = greatest integer les 
=4 
Variance = 1, = npq = 20( 


s.d.= ¥ 


Ex. 10-20. The following da 
filter paper for 80 sets of seeds. | 
x: 0 1 2 3 
y: 6 20 28 12 
Sol. Here n = 10, N= 80 an 
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ked, find the probability that out 


arrive safely 
to be wrecked) 


-=0-91854. 


and “b’ women, show that the 


‘r’ things 
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on 1 (oe a)” —(b-a)™ 
~ (atb)” 2 


Ex. 10-19. Mean of a binomial distribution is 4 and its third moment about mean is 
1-92. Find other constants of the distribution. 


Sol. Here np=4 ..(2) 
M3 = npg (q-p) =1-92 ee) 
q(q~-p) = 0-48 
or q(2q-1) =0-48 
or 2q7 -q-0-48 =0 
or (2g +0-6)(q-0-8) =0 
q=0-°8 as q#-0-3 
P=0-2 
4 
an Zen (i) n= ae} =20 
n= 20, p=0-2,q=0°8 


Mode = greatest integer less than (1 +1) p= 4-2 
=4 
Variance = 11, = npq = 20(0-2) (0-8) =3-2 
s.d. = ig = 3-2 
Hg = npg {l+3pq(n-2)} 
= 3-2 {1+ 3(0-2)(0-8) 18} 
= (3-2)(9-64) = 30-848 


2 2 
92 
p, - b= SP < ons 
p32 (3-2) 
Mg _ 30-848 5 1s 


Pa = 2.2)? 


11 = ¥B, = 03354 
Y¥, = B,-3=0-0125. 
Ex. 10-20. The following data are the number of seeds germinating out of 10 on damp 
filter paper for 80 sets of seeds. Fit a binomial distribution to these data : 
x: 0 1 2 3 4 5 6 7 8 9 10 = Total 


y: 6 20 28 12 8 6 0 0 0 0 0 80 
Sol. Here n = 10, N= 80 and Zf= 80° 


= ye: a (20)1 + (28)2 + 12(3) + (8)4 + (6)5 


ys 80 


A.M. 
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80 
_ 14 
80 
174 
"= “39 
1-74 


EI 5175 
p~ 8 


gq = 1-p=0-7825 
Hence the binomial distribution to be fitted to the data is 
80 (0-7825 +0-2175)"° 


. Required binomial frequency distribution is 
x: 0 1 2 3 4 5 6 7 8 9 10 
y: 7 19 24 18 8 3 1 0 0 0 0 
Ex. 10-21. For a binomial variate x, find p ifn = 4 

and P(x = 4) = 6P (x = 2). 
Sol. The distribution of x is 


PG) = *ezp*q*? 


Now P(4) = 6P(2) 
p* = 6 4c, pg? 
on p> = 36q? (assuming p #0) 
= p = 6q ("pg $0) 
ei = 6-6p 
ay 
P 7 


Ex. 10-22. For a binomial distribution the mean is 4 and variance is 2. Find the 
distribution. 


Sol. np = 4, npg = 2 
1 
I~ > 
1 1 
= 1--=- 
P ae 
n=8 


.. Binomial distribution is 


1¥ 
P(x) =8c, (3) wx = O12. 8. 


1 
AA104S SIS p=. 
P=) 
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Ex. 10-23. [fx andy are binom. 


Find P (x+y 21) 
Sol. Since probabilities of succ 


Let 
Then distribution of z is 


P(z 


P(z2l 


Ex. 10-24, Starting with the id 


n 


ya 


x=0 


find mean and variance of B.D 
Sol. By given identity 


x=0 
Differentiating w.r.t. p 


n 


oe (org? —(n-x)q" 


x=0 


x=0 
n 
= DOr a amd: 
x=0 
n 
ie., dix"c.p*q" : 
: x=0 
ie, x —np(q + p)" =( 


np : 


UY 
tad 
f 
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ee 


~ 
oo 
Ke) 
= 


(assuming p # 0) 
(- p,q $0) 


4 and variance is 2. Find the 
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; 1 1 
Ex. 10-23. [fx andy are binomial variates with n=10, p=— and n=5, p= respectively. 
2 rg 4 


Find P (x+y 21) 


Sol. Since probabilities of success for x and y are same, x+y is a binomial variate with 


- 
n=10+5=15, p=-—. 
a: 


Let 


Then distribution of z is 


P(z) 


P(z21) = 1-P(z<)) 
= 1-P(z=0) 


Ex. 10-24. Starting with the identit 


n 


dex Pa" = (q+ py" 


x=0 


Jind mean and variance of B.D. 
Sol. By given identity 


n 


yen Pg" = (G+ py" 


x=0 


Differentiating w.r.t. p 


n 


Dex xp? 'g?* -(n- qh * tp") = (q+ py 14D) 


x=0 


n 


Diep tig" {xq—-(n-x)p} = 0 
x=0 


n 
> "cxP*q"*(x- np) =0 
=0 


a n 
Dix"e.p"q"* —np Y"c,p*q"* =0 
x=0 x=0 


¥—np(q+p)" =0 
X=np- 


ll 
a 
iv) 
Nn 
Fear 
Nl 
Nae, 
an 
wn 
nN 
| 
Oo 
i 
a 
oi] 


(1) 
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Differentiating (1) w.rt. p 
Ye, [apt tg" —(n— x) p*q"*") (x—np) + p*q"*(-n)] = 0 


x=0 


n n 


ie, d"CxP™ gq" *" (xq -(n— x)p}(x—np) -ny\"c,p*q”* =0 


x=0 x=0 
Lyne ptgh 
ae —)) c,p*q"* (x—np)’ -n(q+ p)" =0 


=> H2 ="pq. 

Ex. 10-25. Two dice are thrown n times. Let x denotes the number of throws in which 
the number on first die exceeds the number on the second die. Find the distribution of x. 

Sol. Let in a throw success means : 

“no. on first die exceeds the no. on second die”. 

’ Different possibilities of success are : 
(2, 1), (3, 1), (3, 2), (4, 1), (4, 2), 4, 3) 
(5, 1), (5, 2), (5, 3), (5, 4), (6, 1), (6, 2), (6, 3) (6, 4), (6, 5) 


als AS 
= prob. of success in one throw = 36. 12- 


Ss 
. x is a B,V with no. of trials = and prob. of success = DD 


5 
~ b ? 
x (m 5) 


Ex. 10-26. /fx is binomially distributed with parameters n and p and y is beta distributed 
with parameters k and n—k +1, then 


1 we! n-k 
PB =p ray (—u)""* du 


and hence F,(p) =1-F,(k 1). {where F,(.) denotes the c.d. f of x etc.} 
Sol. Let P= P(x2k) 
= 1—-P(x<k) 


k-1 


-1-))"e;p igh 


j=0 


k= 
ae ange "clip tq” 1 —(n-j)p? Fae 
dp j=l 


: _ ng Si") o4 a-j -(n- 7) ad c; piqrt | 


jel 
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se er Oar dc he 


el 
Sf ja 
lOta- =p 


ae ~ n! 
=nq a ree 
UG -Dia-j 


=n Cy Pg 
Integrate w.r.t. p 


[P]’p=0 =n. Neg [ior 


=n, Pays fi 
(n-1)! 
(k-1)!(n-k)! 


___ Pmt) 
[(k) T(n-k +1) 


P(x2k) =n 


eS ee 2 
Pa 
. 1- P(x <k)= F,(p) 
ie, 1- F,(k-1) = F,(p) 


Ex. 10-27. A drunk performs a 


He starts at zero. He takes succ 
p and to the left with probability 1 


after n steps. Find the distribution . 


Sol. Let x; denote the ith step 
Then x 


Define a variate y;5.t. y; = 7 
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i 


q” *(-n)|=0 


c,p*q” * =0 


es the number of throws in which 
i die. Find the distribution of x. 


, (6, 5) 


ess = D2 ‘ 


ers nand p and y is beta distributed 


‘1-u)"* du 


the c.d. f of xetc.} 


~(n- jypi ght] 


-j -(n-j) "Cc, p! a Jn : 
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ee? 


e 
jn! pg ues n! j ot 
yf ape OD 4 


j=l 


" k-l 
~| n! pi n-j n! jn-j-l 

=ng" ~ ) \———__—~ p!" q" I ~-——_—_ pq 

St. I)! (n- pi jM(n-j-Y! 

kal . . . . 
=nq"! {n"! Cj-1 pi! qui =" Ang, pigh i} 

j=l 

k-1 


= ng" -nl""eg gq’! oP laps pig ] 
=n" cy, pt gt 
Integrate w.r.t. p 


_ P _ - 
[P]? p-0 = n.” ra } p* ‘(1 —p)" dp. 
. iy Qo > 


=n" "e. ‘ic (l-u)"* du 
oO 


(n-1)! 
wis 1)!(n—k)! 


oa : wk! n-k 
P(A) I (n- nea i =u)" du 


: Pix . b 7 i k- ye we du 


1 
~ BK, n-k+l) 


. 1-P(x<k)= F,(p) 


[rw a du)" du. 


ie, 1-F.(k-1)= F,(p) 


Ex. 10-27. A drunk performs a ‘random walk’ over positions 0, +1, + 2,...... as follows : 


He starts at zero. He takes successive one-unit steps, going to the right with probability 
p and to the left with probability 1-p. His steps are independent. Let x denote his position 


+ 
after n steps. Find the distribution of = 2 - and then find E(x). 


Sol. Let x; denote the ith step of the drunk. 


Then x, =1 if drunk goes to right 
= -] if drunk goes to left 210412 3 
. x; +1 
Define a variate y;,S.t. y; = “2 
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Then y,; =1 if x; =1 
=0 if x, =-1 
Then P(y; =1)=p and P(y; =0)=1-p=q. 
. D(x, +1) Ix,tn_ xtn 


Let Z= y= 7) ,) 2 


i=l 


Then z is a B.V with parameters n, p. 


E(x)+n 

Also E(z) = =a te 
E(x) = 2np-n . 
= n{2p-]}. 


Ex. 10-28. [fx has a b(n,p) and y follows negative uionomial distribution with 


parameters r and p. 


Show that 
F,(r-1)=1- F,(n-r) 

Sol. RHS. = 1-F,(n-r) 

= 1-P(Y<n-r) 

= P(Y>n-r) 

Yen-r+l 
Put y=(n-rt+l)t+t 
bas » nite gh Tilt pF 


t=0 


= p woe ae q' 
t=0 
wo {| r-l 
2: A ane > ci: C15 q' 


t=0 [k=0 


r-l] oo 
a p" aes "oy ps ara “| 
k=0 


t=0 
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Take t-(7-1-k)=j 


r-1 
n 


= ¢ 
k=0 
Ex. 10-29. Let x,;x, ben 
P < Pz, Show that | 
P(x, s 
Sol. Let x be a B.V. with pari 


| 
M- 
3 
an 


P= p(x Sk) 
x=0 
= rie +" 
+, 
IP = n-1 1 n-1 
i = —-ng"™ + e{q —| 


1G {kp* qr * -(n —kp* 
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ee —————————— TEL 


Take t-(r—-1-k)=j 


2 2 
r-l 00 ; : 
- re uae Ros > r-l-k+j Opp gitr-k 
k= j=0 
r-l > : ; 
2 p'q" "cp q' aes qi 
k=0 j=0 
e vionomial distribution with r-l 
= p’q" Mee q' {1 4rnk C; qa tl Zs q? } 
k=0 
r-| 
= pq’), Oe {14 -bygs EDD 2UGs)) q° anf 
oy 
k=0 
r-1 7 
_ pq"), a qe (l -qy" 
k=0 
rd ' r-1 
- > "co, p* q"* =F,(r-1). 
k=0 
d 
Ex. 10-29. Let x,;x. be two B.Vs with parameters n,, p\;nz, pz respectively. If 
p’ P; < Pz, Show that 
P(x; Sk) = P(x_ Sk), (k =0,1,.....n). 
Sol. Let x be a B.V. with parameters n, p 
ag k 
P=p(x<k) = es pq 
x=0 
cK cass} ‘ = gh + cy pq™h +" cy pigh tat Cy gn 
+e p* qr 
r-1 ? ; 
nt ‘ 4 
yee erat = caf B= ang’ +", {q"! (nD) pq"? } +" 09 {2pq" - p*(n-2)4""} 
k=0 dp 
* Boke cya{(k-1) pt? ge ~(n-k+ lp "q"*} 
» pene rt 
‘0 +", {kp a’? -(n -k) p* qh} 
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= —(n-k)" Cr p< q™ —k-1 


<0 
.. P decreases as p increases 
=>. P(x, $k) = P(x, 2k). 
EXERCISES 
1. Ifxisa b(n, p) {i.e., binomial variate with parameters n, p}, then show that 


2 
and B{2- p| _ PUl-p) 
: n 


n 
2. If x is a random variable distributed according to the binomial law P(x = k)=b(k) = 
"¢,p*g"*,k =0,1,...05.. .q=1-p, show that 


b(k+1)_n-k. p 
bk) k+l ke 
3. If x is binomially distributed with parameters n and p, what is the distribution of 
y=n-x. 


4 
2 14 
4. The m. gf. of a random variable x is & 3° é| - Show that 


5 1 x 2 9-x 
Piu-20<x<u+20)= 9) %e,(4] @ 
x=1 


2 
5. A’s chance of winning a game against B is 3 Find his chance of winning at least 


Ans. bid 
243 


6. The incidence of occupational disease in an industry is such that the workers have a 
20% chance of suffering from it. What is the probability that out of six workers 4 or 
more will catch the disease. 

Ans. ES 
3125 


{ 
7. If 5 coins are tossed, what is the probability that there shall be at least 4 heads ? 


mi 


8. Out of 4000 fatnilies with 4 children each, how many would you expect to have at 


three games out of five. 


; 1 
least 1 boy ? Assume that the probability of a male birth is 5 [Ans. 3750] 
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9. 


10. 


11, 


12. 


13. 


14, 


15. 


16. 


17. 


Find the probability of gue 
examination. 


If we take 100 sets of 10 to: 
to get 7 heads and 3 tails ? 
In the above example in ho 


An ordinary six-sided die i 
4, 3, 2, 0 aces ? 


An experiment succeeds tw 
trials there will be at least ¢ 


A teacher claims that he co 
whether they will obtain I 
demonstrate his claim he fc 
being correct in 4 cases, 


In litters of 4 mice the nur. 
noted. The figures are give: 
No. of female mice 0 
No. of litters 9 
If the chance of obtaining < 
constant of unknown proba 

[Ans. 0-466, exp 


Ten coins are tossed 1024 
these frequencies with the 
the data. 

No. of heads (x) 0 
Frequencies (f) 3 
No. of heads (x) 7 
Frequencies (f) 128 


[ Ans. 


Out of 800 families with 4 c 
(i) 2 boys and 2 girls. 
(ii) at least one boy. 

(iii) no girl, 

(iv) at most 2 girls ? 
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5 


i‘ p}, then show that 


inomial law P(x = k)=b(k) = 


p, what is the distribution of 


w that 


his chance of winning at least 


is such that the workers have a 
lity that out of six workers 4 or 


Ans. ccal 
3125 


2 shall be at least 4 heads ? 


Ans. ch 
[amis 


ny would you expect to have at 


ds, 
irth is 5 


[Ans. 3750] 
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9. 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


17. 


Find the probability of guessing correctly at least 6 of the 10 answers on a true-false 
examination. 


If we take 100 sets of 10 tosses of a perfect coin, in how many cases should we expect 
to get 7 heads and 3 tails ? ‘ [Ans. 12] 
In the above example in how many cases should we expect to get 7 heads at least? 

[Ans. 17] 
An ordinary six-sided die is thrown 4 times. What are the probabilities of obtaining 
4, 3, 2, 0 aces ? 


625 25 5 1 
Ans. —— ,— . > >» = 
1296 216 324 1296 


An experiment succeeds twice as often as it fails. Find the chance that in the next six 
trials there will be at least 4 successes. 
Ans. a8 
729 


A teacher claims that he could often tell while his students were still in their first year 
whether they will obtain I, II, III divisions or fail in their final examinations. To 
demonstrate his claim he forecasts the fates of 8 students. Find the probability of his 


being correct in 4 cases. 
ren 2835 
32768 


In litters of 4 mice the number of litters which contained 0, 1, 2, 3, 4 females were 
noted. The figures are given in the table below : : , 

No. of female mice 0 1 2 3 4 Total 

No. of litters 9 30 35 24 5 103 


If the chance of obtaining a female in a single trial is assumed constant, estimate this 
constant of unknown probability. Find also expected frequencies. 
[Ans. 0-466, expected frequencies are the respective terms in the binomial 
expansion of 103 (0-534 + 0:466)‘] 
Ten coins are tossed 1024 times and the following frequencies. observed. Compare 
these frequencies with the expected frequencies obtained by fitting binomial dist. to 
the data. 


No. of heads (x) 0 1 2 3 4 5 6 
Frequencies (f) 3 8 39 106 188 257 226 
No. of heads (x) 7 8 9 10 
Frequencies (f) 128 59 7 3 


Ge BE OD ie BOA ST Re Do AO 
" fs 1 10 45 120 210 252 210 120 45 10 1 


Out of 800 families with 4 children each, how many families would you expect to have 
(i) 2 boys and 2 girls. 
(ii) at least one boy. 
(iii) no girl. 
(iv) at most 2 girls ? 
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1 


Assume equal probabilities for boys and girls. 10.2.1. First four momen 
Ans. (1) 300, (i) 750 (iii) 50, (iv) 550] For Poisson distribution th 
18. A room has three lamp sockets. From a collection of 10 light bulbs, of which only 6 
are good, three are selected at random and are put in the sockets. What is the probability 
that there shall be light ? 


10-2. Poisson Distribution (P.D.) 
Poisson Probability Distribution. The poisson probability dist. of the variate x is u 


x 
P(x) =e” 7 ,x=0,1,...0 
x! 


The variate x is called Poisson Variate and m is called the parameter of the distribution. 
Poisson Frequency Distribution. The poisson frequency dist. of the variate x is 


m 
F(x) = Ne™ x =0,1,.. 
x! 


where N is the total cuiieney 
Poisson distribution is regarded as the limiting form of binciial distribution when n 
(no. of trials) approaches ‘eo’ and p (prob. of success) approaches zero such that mp remains 


a finite constant m. . pl. 
For B.D. Pa) = "epg" 
Lt P(x) = Lt "Cpa, 
p->0 a0 
np=m np=m 


ae =o (2) (1-2) 
noo x!(n—x)!\n n 
natal OST 
nro x\(n-x)! n n n | 
2m, mn=l)..(n- DL my (-2)" | 
x! os > n* n n 


m* n 1 
= — Lt |—].J1-—|...... 
x! nooln n Writing x° as x(x-1)(x- 
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1 


i) 300, (ii) 750 (iii) 50, (iv) 550] 
F 10 light bulbs, of which only 6 
e sockets. What is the probability 


ability dist. of the variate x is 


ro) 


the parameter of the distribution. 
mncy dist. of the variate x is 


eo 


of binomial distribution when n 
oaches zero such that mp remains 
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10.2.1. First four moments about mean 
For Poisson distribution the probability of x succesovs 1s given by 


x 
PC) = 6 x = 0,1, Qyeesernso 
x! 


T 
M 
& 
& 
i 
Ms 
& 
3 


(0) 


ao 


15(0) = >) x?P(x) =D {x(x-1) + P(x) 


x=0 x=0 


e™ 


Ms: 


x(x- any + xP) 


x=0 x=0 


= em? (l+m+...)+m 


=e "me" +m=m> +m 


| ‘ buy = 15(0)~ {44(0)}? =m? +m—m? =m 
a 
15(0) = } x9 P(x) 


x=0 


Writing x? as x(x—1)(x-2)+3x(x-1l)+x 


> {x(x — I(x — 2) + 3x(x-1) + x}P(X) 


13(0) = 
x=0 
= m>+3m? 4m oe: 
15 = 15(0)—3p5(0) pf (0) +2{u4 (OF? 
= m>4+3m? +m- 3(m? +m)m+2m> =m 
14, (0) = ox* PC) 
i x=0 
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But x* =x(x—-1)(x—2)(x-3) + 6x(x—1)(x-2)+ Tx(x-I+x 


i, (0) = >) {x(x - (x 2)(x - 3) + 6x(x— D(X 2) 47x(-1) + x}P(X) 
x=0 
= m* +6m> +7m? +m. 
sw, = pi,(O)— 4110) 114 (0) + 625,(0) {1450}? - 3,0)? 
=m +6m3 + 7m? +m—4(m? +3m? +m)m + 6m? +m)m? —3m* 


=3m? +m. 
Ex. 10-30. /n a Poisson distribution, P(x) fox x = 0 is 10%. Find the mean. 
Sol. Let m be the mean 


x 


Then P(x) = eo" 

x! 
P(0) =e” 
e"™=0-1 
1 

m— —_=10 

01 . 
\ m = log, 10 =2-3026. 


Ex. 10-31. Deduce first four moments about mean for Poisson distribution from those 
of binomial distribution. 

Sol. Poisson distribution is the limiting form of binomial distribution when n (no. of 
trials) tends to infinity and p (prob. of success) tends to zero such that np remains a finite 
constant m. 

For binomial distribution, 


py (0) = np, HW. = "pq, H3 = npq(q - P) 


and bg = mpg (1+3pq(n-2)} 
u1(0) for P.D.= Lt 1} (0) for B.D. 
on | 
np=m 


= Lt (np)=m 
no 
p-0 
np=m 


ti, for P.D.= Lt p2 for B.D. 
noo 
p-0 
np=m 
= Lt (npq) 
noo 


np=m 
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}t3 for 


and Uy for 


Ex. 10-32. Let and ys, 
distribution respectively. Obtai 


| Hence deduce the values o 
| Sol. By def., 


x=0 

lc tg 
ees x-k r+] ~2 a 

r dé : x! 
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-l+x H3 for P.D.= Lt i, for B.D. 
n->00 


p0 
np=m 


7x(x—-1)+ x}P(x) 
= ane npq(q - p) 


np=m 


yy m 2m 
= Lt m1-— || 1-—— J= 
im? — 3m‘ Lin ")( =) 7 


and tg for P.D.= Lt py for B.D. 
non 


1%, Find the mean. 
p-0 


np=m 
= Lt npq{l+3pq(n-2)} 
non 


po 
np=m 


Lt n(1-2)} 92 (1-2 na} 
no n n n 

Lt n(1-2){1+an(1-)(1-2}} 
n—00 n n n 


= 3m? +m. 


Ul 


oisson distribution from those 


al distribution when x (no. of 


\ such that np remains a finite Ex. 10-32. Let Xand, denote the mean and central rth moment of a Poisson 


distribution respectively. Obtain the recurrence formula 


d 
Mra =A, + Ee 
oq(q - P) dr 


Hence deduce the values of B, and B,. 
Sol. By def., 


= 4 
L, = > (x-A)'e i 


_ x0 


n 
i oe = a (et (x—ay MB tae ee —ay -r(x—a)y te * 17) 


= ye WIA (x-A—r(x—-Ay te 27} 
=d x! 


a1 3 c-aytet ES e-aytet E 
3 xt eaux 


| 
= ries —Tp,-1 
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dy 
Heat = nae +4 m 
Put r =1,2and3 
d 
H. = App tA =A as Hy =land p, =0 
dn 
dyn _ 
[3 = 2H) + A = 
| diz 2 
= 3Ap, +A—= = 30 +A 
and Hy | Ho ah 
2 
1 a 1 
3 4 
= —=~—andB, =— =3+-. 
Pe ed rr 
Ex. 10-33. For a Poisson variate x with parameter i, show that 
, , ay, 
Brat = Ap; + AT 


where 1). = E(x’ ) and r is a non-negative integer. 


Sol. By def., 
My = E(x’) 
Z Sve x 
. x=0 x! 
id r 
a a ys te* +e att} 
x: 
x=0 
vere yt r+] -A xv 
= = x! 2X a x! 
, 1 f 
=~, Fabra 
1 WB, 
> Mra = AB, + AT” 
10.2.2. Measure of skewness and kurtesis 
By def. 
2 2 
— {4s pm 
17 vB, u3 ae es 
¥; 27 0asm> © . 
2 
ee = 3m tm 3,1 
Ya =Pa-3= u3 m m 


¥2 7 0asm—> o. 
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Ex. 10-34. For Poisson dist 


Sol. LI 


10.2.3. Mode of the Poisson Dis 
In Poisson distribution the p 


P 


The mode is that value of x 
_ P(x+1) ie, 


Consider J 


or a 


or 


Similarly other inequality gi 
From (7) and (ii) modal valu 


Case I. If m is an integer, the 


P(x=m 


Now Pa=m- San 


P(x= 

.. In this case P(x) increase 

to decrease. Hence dist. is bimod 

Case I. If m is not an integer 

m= 

when x takes the value ‘a’ (whicl 

i 

' x=a (greatest integer less | 

10.2.4. Moment Generating Fu 
By def., 


My 
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SS  — 


Jip =land p, =0 
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Ex. 10-34. For Poisson distribution show that 
MOY ;7¥2 =1. 


1 1 
= mim =. Hb 
Sol. L.HS. de 9 
10.2.3. Mode of the Poisson Distribution 
In Poisson distribution the probability of x successes is given by 
x 


P(x) = eo 
x! 


The mode is that value of x for which P(x) is greater than or equal to P(x-1) and 


P(x+l) ie., 
P(x-1) ¢ P(x) 2 P(x+)) 
Consider P(x-1) s P(x) 
cee mr! ak m* 
a epee cal 
or x<m (2) 
Similarly other inequality gives x2m-1 ...(ii) 
From (i) and (ii) modal value x satisfies the inequality 
m-lsxsm . (iv) 


Case I. If m is an integer, then (mm —1) is also an integer. 


P(x=m) om” ~(m-!)! 
oer P(x=m—1) nl em 
eee 
m 
P(x=m) = P(x=m-1) ...(iv) 


. In this case P(x) increases till x = m-—1 and then (iv) holds and after that it begins 


to decrease. Hence dist. is bimodal with modes m—1 and m. 


Case I. If m is not an integer, let 
m=a (an integer) + f(a fraction) 


when x takes the value ‘a’ (which is less than m but greater than m—1) from (i) and (ii) 
P(a-1) < P(a) > P(a +1) 
x =a (greatest integer less than m) is the mode. 
10.2.4. Moment Generating Function 
By def., 


. t 2 t os m 
M(t) = E(e") =} ee er 
x=0 


2 ty\x 

= me ) - f t_ 

= em =e pine = ge 1) 
0 x: 
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Now M;(t) = Efe} =e ™ E(e*) 


em M(t) = eme'-I-t) 
Deduction of Moments 


M, (t) = eme -I-t) 


l 

ee 

+ 

= 
—, 
y(% 

+ 
wpm 
=a be 
epee 
|, 
———, 
|S 
w[ 
N 


2 3 4 


t 
But M;(t) = I+pe+ias ths a that caseees 


Hh, = O,p. =m, p3 =m, HW, =m+3m’. 
Ex. 10-35. Show that the sum of two independent Poisson variates is a Poisson variate. 
Sol. Let x, and x. be two Poisson variates with means m, and m). 


Let Sagas 
Then Mo(t) of x = {Mo(t) of x1}. {Mo(t) of x2} 
moe My (oof x = 
_ M(t) of x2 = em (e'-1) 
M(t) of x = em tm Xe'=1) 


which is a m.g.f. of a Poisson variate with mean m, +m. 
"x is a Poisson variate with mean m, +m. 
Ex. 10-36. Find M,(t) of the difference of two independent Poisson variates with 
means m, and m, and show that it is not a Poisson variate. 
Sol. Let u=x-y 
Then - My(t)of u = E(e™)= Ee’) 


E(e”)E(e”) 


f 


m(e=1) ome -l) 


é 


= exp {m,(e' -1)+m,(e“ -D} 


Since Mo(t) of u is not of the form ene“) yu is not a Poisson variate. 
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10.2.5. Cumulative Function a 
By def. Cumulative functio 
. K 


. k 
Ex. 10-37. [fx is a Poisson 

P(x 
Find mean and variance. A 
Sol. Let 4 be the paramete1 


Then i 


Now P(x 


. Mean = Variance = A = | 
Ex. 10-38. [fx and y are in 


P(x 
and P(y: 
find var ( 


Sol. Let m,,m, be parame 
respectively are 


and (2) => e 


Var 
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Pr eke acs Sn ae 


i) 
-t) 


lg =t+ 3m’. 
in variates is a Poisson variate. 


; m, and my. 


t) of x2} 


Ny. 


pendent Poisson variates with 


, 


(e* -1)} 


a Poisson variate. 
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10.2.5. Cumulative Function and Cumulants 
By def. Cumulative function is given by 
| Ko(t) = log Mo(t) = log e™@-Y 


= me’ -1) 


Pr Pp 
mit+—+—+:; 
2! 3! 


aS k, (0) = m=k, =k; =....... 
Ex. 10-37. [fx is a Poisson variate such that 
P(x =1) = 2P(x = 2) 
Find mean and variance. Also find P(x = 0). 
Sol. Let A be the parameter of x. 


Then P(x) = gah 
x! 
Now P(x=1) = 2P(x=2) 
2 
“Ay = Io 
i eae, 
i= 


. Mean = Variance = A = 1 and P(0) =e". 
Ex. 10-38. /fx and y are independent Poisson variates such that 


P(x=1) = P(x=2): (1) 
and P(y =2) = P(y=3) ...(2) 
find var (x—2y). 


Sol. Let m,,m, be parameters for x, y respectively. Then probability f"s for x and y 


respectively are 


6 Nak iby 
x! 
-m, my 
Ply) = bere 
2 
: Ba, es a ld 
Hy ee 1! ay 
=> m, =2 
-m, M2 yom Mm 
and (2) => ere: i e i 
> m, =3 
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and 


Var (y) 


let u 


Since x and y are independent, 
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m, =3 

x-2y 

x-2y 

(x-X)-2(y-) 

E(u-a)? 

E{(x-%)-2(y-y)}? 

E{(x-%)° +4(y~y)’ -4(x-¥)(y-)} 
E(x- 3)’ +4E(y- 9)? -4E(x-x)(y-9) 


E(x-x)(y-y) = E{x-y—xy-xy+x-y} 
= E(xy)-x-y 
= E(x)E(y)=x-y 
=x y-x-y=0 
Var (u) = Var (x)+4 Var (y). 
Var (u) = 14. 
Ex. 10-39. [fx is a Poisson variate with mean m, find 
() Ee). 
(ii) E(xe™* 
‘ -m mn ~kx 
Sol. (i) E(e*) = ze rik 
-m (me as _ om ome! 
=e 2 a oe 
~m(l-e*) 
= —m —kx 
(ii) E(xe"™) = De head 
a >> m* —kx 
cal (x-}D)! 
omy met" 
7 (x-1)! 


= -k 
e™ (me *)e™ 


ky 
= meme )-k 
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ened eR 2 
Ex. 10-40. Show that in a. 
2 
mean is —. 
e 
Sol. Since in Poisson distri 


Poiss 


.. Mean deviation about rr 
I 


Ex. 10-41. /f x and y are 
Prove that the probability th 


exp. {mt+m't” 


—m-—n'), 

Sol. P(x—y =r) is require 

Now *—y will take the va 
where s =0,1, 2,... 

By compound prob. theoren 
is 


.. By total prob. theorem p1 
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2 _A(x-Xx)(y-y)} 
) -4E(x-x)(y-¥) 


y} 
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Ex. 10-40. Show that in a Poisson distribution with unit mean, mean deviation about 
a2 | 
mean is —. 
e 
Sol. Since in Poisson distribution with parameter m, mean is m. 
: m=t1 
Poisson distribution is 
zl 
P(x) = —,x=0,1,2... 
x! 
*. Mean deviation about mean is given by 
M.D. = E|x-] 


: % oo! 
>a 
a x! 


% a | 
= ée 
e _ 
= ‘ety 1)! -+| 


x=2 


It 
a 
1 

—_—_—__ 

_ 

+ 

=|- 

Ve 


Ex. 10-41. /f x and y are poisson variates with means m and m’ respectively. 
Prove that the probability that (x-y) has the value r is the co-efficient of t" in 
exp. {mt +m't"!—m-=m’'). 

Sol. P(x—y =r) is required. 

Now «—y will take the value r when x takes the value r +s and y takes the value s 
where s = 0,1, 2,... 


By compound prob. theorem, prob. of x taking the value r +s and y taking the value s 


on m'** em m'* 
(rts)! s! 


*. By total prob. theorem prob. of (x — y) taking the value r. 


aa es m’* 
en 


—(m+m’') 


is 


mm! 


é , co-efficient of t” ine 
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pW cad 
= co-efficient of t’ in e’"*"* ~"™™ 
x-m Peer 
7 7 z= ind its 
Ex. 10-42. fx is a Poisson variate with mean m, find M.G.F. of i, and fi 


‘limit when m-> ©. 


Sol. M,(t) of z = E(e”) 


x-m 


E HED = em ph evm 


e'™ My, + of x 
vm 
Tec 
= eid gm(e 1) 


# 
= emer 1-10 


t 
log {My(t) of z} = m(e™™ —1)-tVm 
2 3 
Tare as ie ere) pe SP en Ya 
«moet ae) ae) ~~} 


t 
| 2 ining ~7— and higher powers 
a + terms containing aA and higher p 


1 
Lt log (Mj(t) of z}= 5 
mo 


2 


1 
xt 
Lt My(t)of z=e? . 
mo 
Ex. 10-43. If x is a PV. with parameter m show that 


: 
Pix>r)< yr ‘=0, 1,2,.. 
r: 


Sol. We have 


Poc>r) = POO 


x=r+l 
ao ae m 
= é voter 
x! 

x=rtl 


am mt? | 
a tee Poe awl 
= @ [an (+2)! ” 


THEORETICAL DISTRIBUTION 
rea hae tar 


we have (rt 
=> 
(lI) => 
Now e’ 
> 
“ ()> 


Ex. 10-44, The probability that 
is 0-01. Byapplying Poisson’s appn 
of 100 items selected at random fron 
item is 2 : 
e 
Sol. Here » h 


R m 
.. Prob. of x defective items is 


P(x) 


P(x $1) 


Ex. 10-45. Find the probability 

200 fuses if experience show that 2% 
Sol. Let the presence of a defect 
Then p = prob. of success = 0-0% 
Here n = 200. 
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itm’ t 


7.F of => a je 


hmm’ 


x 


and find its 


...(1) 
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we have (r+)! = (r+i(rt+i-D)...(r+ Dri 


2i(Gi-1)...L,r!=itr! 


1 1 
ee <n 
ad (r+i)! (iter!) 
r+! r+2 
-m|m m 
es ee (a ay | 
cm’ [gm 
=e 7 ees sistas ...(2) 
; mn? 
Now e” = lt+m+—+t4...... 
2! 
m 
> MN +m +....., 
2! 
2 
zy elma ies J. 
r 
uo Q> Plx>r)<=., 
r: 


Ex. 10-44. The probability that an item produced by a certain machine will be defective 
is 0-01. By applying Poisson’ approximations show that the probability that random sample 


‘of 100 items selected at random from the total output will contain no more than one defective 


so igre 
itemis —. 
ée a 
Sol. Here » n= 100, p=0-01 
a m=np=1 
. Prob. of x defective items is given by 
x -1 
P(x) =e™ wee ‘ais as 
x! x! 
P(x} 1) = P(x<1) 
= P(O)+ PC) 
=eltte! 
_2 
e 


Ex. 10-45. Find the probability that at most 5 defective fuses will be found in a box of 
200 fuses if experience show that 2% of such fuses are defective. 

Sol. Let the presence of a defective fuse in the box be called success. 

Then p = prob. of success = 0-02 

Here n = 200. 
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Since n is large and p is small the distribution can be taken to be Poissonian. 
m = np =(200)(0-02)=4 


“™ = e+ =0-0183 


% 
WW 


x x 
os = (0-0183) 4— 
x! x! 


a] 
o-~ 
ta 
—_ 

tl 


P(0)+ PCL) + P(2) + P(3) + P(4) + POS) 


Required Prob. 


= (0-0183) edtgaes yee ee = 0-78. 
3 3 15 


1 
Ex. 10-46. /n a certain factory turning out razor blades, there is a small chance 500 


for any blade to be defective. The blades are supplied in packets of 10. Use Poisson’s 


distribution to calculate the approximate number of packets containing no defective, one 


defective and two defective blades respectively in a consignment of 10,000 packets. 


1 
.H = —,n=10,N =10,000 
Sol. Here P 500 


3 
\ 


} 
= wa abe S0002 
500 50 


eM = ¢? = 0.9802 
. Prob. of having x defective blades is given by 


(0-9802)(0-02)* 
x! 
. No. of packets containing x defective blades 


(0-9802)(0-02)* 
! 


P(x) = 


= 10,000 


. No. of packets containing no. defective blades 
= 10,000 (0-9802) = 9802 
No. of packets containing one defective blades 
= 10,000 (0-9802) (0-02) = 196.04 
w 196 
and No. of packets containing two defective blades 
2 
= 10,000 (0-9802) oo =1-9604 
ww 2. 
Ex. 10-47. Fit Poisson's distribution to the following and calculate theoretical 
frequencies : : 
Death 0 1 2 3 4 
Frequencies 122 60 15 2 1 


. (122)0 + (60)1 + (15)2 +(2)3 +(1)4 
Sol. m = mean = a ee NG 
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—m 


ne He a1 4(0-5) + (- 


=1-0-5+0-125-0-0208 + 
= 0-61 (nearly) 
.. Theoretical frequency of x « 


.. Theoretical frequencies are 

122, 61, 15,2a 

Ex. 10-48. A car-hire firm has 
demands for a car on each day is 
Calculate the proportion of days on 


which some demand is refused (e"' 
Sol. Let x be the number of dei 


Then dist of x is 
P(x) 


Now the proportion of days on 


and the proportion of days on v 


Ex. 10-49. For a Poisson variai 


r c cy Uy} 
Sol. By def. 


Hyst 
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————_——$———— : 


ken to be Poissonian. _ 604304644 _ 


0-5 
4 200 
-m _ 05 1 2,1 ae. 4,1 5 
. @° =e ~ =1+(-0-5)+— (-0-5)* +—(-0-5) +—(-0-5)" +—(-0-5)? 4.00.0... 
4% 2! 3! 4! 5! 
x =1-0-5+0-125-—0-0208 + 0-0026-—0-00026 
_ P(3) + P(4) + PCS) = 0-61 (nearly) 
.. Theoretical frequency of x deaths is 
32 32 | =0-78 x 
Pa ae 5 200.65 0:5)" 
x! 
. —_—_ x 
‘s, there is a small chance 500 = 200.(0-61) (0- > 
1 packets of 10. Use Poisson's ; Me 
ts containing no defective, one .. Theoretical frequencies are 
ament of 10,000 packets. 122, 61, 15, 2 and G. 
Ex. 10-48. A car-hire firm has two cars which it hires out day by day. The number of 
1,000 demands for a car on each day is distributed as a Poisson distribution with mean 1-5. 
: Calculate the proportion of days on which neither car is used and the proportion of days on 
02 which some demand is refused ee = 0-2231). 
we Sol. Let x be the number of demands for a car in a day. 
Then dist of x is 
-1s (1-5)* 
P(x) = e a-sy" 
x! 
Now the proportion of days on which neither car is used 
= P {of no. demand in a day} 
= P (x=0)=e7!* =0-2231 
)-02)" and the proportion of days on which some demand is refused 
= P(x>2) 
= 1-P {x <2} 
9802 
= 1- P(x =0)- P(x =1)~ P{x =2)} 
)-02) = 196.04 = 1-e {141-541-125} 
= 1-(0-2231) {3-625} = 0-19126. 
Ex. 10-49. For a Poisson variate with parameter i, show that 
0-02)" =1-9604 Aly Mpa +" C2 Mpg C,H} = Brat: 
2! Sol. By def. 
= _4yrtl 
wing and calculate theoretical rat ~ E(x—A) 
© x 
3 4 i = Do-ayter* 
2 1 | x=0 
(5)2 +(2)3 + (1)4 . co ax 
a : Binh Ie op 
0 : = )(x-dye = (=A) 


x=0 
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: ae ry ok aS Aye ah 


roca 
- Le I-24 et 


= OCs A+Dfe ae — M4, (changing x to x + 1) 
x=0 


= a 
Er AY + 0c, (x-2)" 1.4", je* a Me 
x=0 


Mp, +” Cy Wp tee C,]— Ap, 


ML cpp tt” C/o] 
(Ho =) 
Ex. 10-50. [fx and y are independent Poisson variates, show that the conditional 
distribution of x given x + y is binomial. . 
Sol. Let 4, 1 be the parameters of x, y respectively. 
Then, z= x+y isa P.V. with parameter 4+. Distributions of x, y, z are 


x 


x 
= eh eee 
a a x! 


aoe 
P(y) =e Sige 


Now P(x =r/z=n) 


P(xarnxty=n) 
P(z=n) 


pi Sad) Daten 
P(z=n) 

_ P@er) Py=n-r) 

7 P(z=n) 


{." x, y are independent} 


‘rt “(n-r)! 
et) (A+p)" 
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ee 


="¢ 
i 
where p= yy. 
which gives the conditional dis 
nand p= : ‘ 
A+ 


Ex. 10-51. A telephone s 
hour. The board. can make a 
distribution to find the probabi: 

Sol. Let x donate the no. o 


m= ave 

_ 604 

60 

P(x) = e" 

.. Reqd. prob. = P(x 
sie 


1. For a Poisson distribution 


2. Ifx is the number of occu 
P(x 

3. IfxisaP.V. s.t. 
P(x 
find (7) m, mean of x(ii) B 


4. Examine, if the following 
distribution : 
Mean = 1-5 cm, Variance = 


H3 =1-Scm*, py =8-25« 


5. Ina certain factory turning 


lens to be defective. The lei 
to calculate the approximat: 
two defective, three defecti 


02 = 0-9802) (See 10-« 
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me 


x 
-2 
ae 


= AM, 


(changing x to x + 1) 
Nel 
lc, jer — 7 Abr 
x! 


WH, 


(ty =D 
riates, show that the conditional 


tributions of x, y, z are 


{." x, y are independent} 


nand p= 
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_ n! f x r mM n-r 
~ rl(n—r)! A+u A+ 


="o pl gq? 
A 
where Be 


which gives the conditional distribution of x given x + y =n. This isa B.D. with parameters 


A+p- 


Ex. 10-51. A telephone switchboard handles 600 calls on an average during a rush 


hour. The board can make a maximum of 20 connections per minute. Use the Poisson 
distribution to find the probability that the board will be overtaxed during any given minute. 


Sol. Let x donate the no. of calls per minute. x is a P.V. with mean m, where 
m = average no. of calls which board can handle in a minute 


oe iG. 
60 
x 
P(x) = 7!) 
x! 
.. Reqd. prob. = P(x>20) =1-P(x< 20). 
20 
-19 (10)* 
ay ig 
me x! 
EXERCISES 


- Fora Poisson distribution with parameter A show that 


P(xt+l)= EAs PG), 
x+l1 


. Ifx is the number of occurrences of the Poisson variate with mean m, show that 


P(x =n)- P(x=n+1)= P(x=n) 


. IfxisaP.V. s.t. 


P(x = 2) = 9P(x = 4)+ 90P(x = 6) 


find (7) m, mean of x(ii) B, the co-efficient of skewness. 


- Examine, if the following are consistent to be the first four moments of a Poisson 


distribution : ( 
Mean = 1-5 cm, Variance = 1-5 cm? 


3 =1-5cm?,p, =8-25cm*. 


1 
. Ina certain factory turning out optical lenses there is a small chance 500 for any one 


lens to be defective. The lenses are supplied in packets of 10. Use Poisson distribution 
to calculate the approximate number of packets containing no defective, one de‘sctive, 
two defective, three defective lenses in a consignment of 20,000 packets. (Given that 


e — 0.9802) (See 10-46). 
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6. Find the mean and standard deviation for the table of deaths of women over 85-year 


10 


old recorded in a three years period : 
No. of deaths 


recorded in a day 0 1 2 3 4 5 6 7 
No. of days 364 376 218 89 33 13 2 i 
Find the expected number of days with one death recorded from the Poisson series 
fitted to the data. [Ans. 1-18, 1-17, 397] 


. Red blood cell deficiency may be determined by examining a specimen of the blood 


under a microscope. Suppose a certain small fixed volume contains on the average 20 
red cells for normal persons. Using Poisson distribution, obtain the probability that a 
specimen from a normal person will contain less than 15 red cells. 


Ans. e~”” . (20y" 
mar x! 


. A large number of observations on a given solution, which contained bacteria, were 


made taking samples of 1 c.c. each and noting down the number of bacteria present in 
each sample. Assuming the Poisson distribution and given that 10% samples contained 
no bacteria, find the average number of bacteria per c.c. [Ans. 2°3026] 


. In 1000 extensive sets of trials for an event of small probability the frequencies ‘f of 


11. 


12. 


the number x of successes are found to be 
Bo 0 1 2 3 4 ae 6 7 
f: 305 365 210 80 28 9 2 1 
Fitting Poisson distribution to the above data, calculate theoretical frequencies. 
[Ans. 301,.361, 217, 87, 26, 6, 1 and 2] 
The following data gives the frequency distribution of the number of men killed by the 
kick of a horse in 10 Prussian Army Corps per army corps per annum over 20 years. 
No. of deaths 0 1 2 3 4 and over Total 
Frequency 109 65 22 3 1 200 
Show that the distribution is roughly Poissonian and calculate the theoretical 


frequencies. (e°®! = 0.5434) [Ans. 109, 66, 20, 4 and 1 (4 and over)] 


A manufacturer of cotter pins knows that 5% of his product is defective. If he sells 
cotter pins in boxes of 100 and guarantees that not more than 10 pins will be defective, 
what is the approximate probability that a box will fail to meet the guaranteed quality ? 


(e~> = 0-006738) 


WO gs 
| Ans. 1-(0 0067) 
x=0 
Letters were received in an office on each of 100 days. Assuming the following data to 
form a random sample from a Poisson distribution, find the expected frequencies, 


correct to the nearest unit. (e* = 0-0183) 


No. of letters 0 1 2 3 4 5 6 7 8 9 10 
Frequency 1 4 15.22 21 20 8 6 2 0 1 


[Ans. 2, 7, 15, 20, 20, 16, 10, 6,3, 1,1] — 
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13. Six coins are tossed 6400 til 
probability of getting six h 


14. An area of 144 square kilo 
appeared constant. To test 


divided into 576 squares o 


squares containing 0, 1, 2, + 


given below: 

No. of flying bombs 

per square 0 
Actual no. of 

square 229 


Calculate the theoretical pc 


10-3. Normal Distribution 


Normal Probability Distril 
mand s.d. o is 


dP =— 
fo} 


The variate x is called norm 
The curve with equation 


is called normal curve. 


If N is the total freq. the cor 


Def : Standard normal distri 


where 


(1) To derive normal distribt 
Normal distribution can be 
when n, the number of trials is v 
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-aths of women over 85-year 


4 5 6 7 
33 13 2 1 
rded from the Poisson series 

[Ans. 1-18, 1-17, 397] 
ning a specimen of the blood 
ne contains on the average 20 
, obtain the probability that a 
5 red cells. 


14 x 
-20 Yr (20) 
[sn e er 
aich contained bacteria, were 
number of bacteria present in 
n that 10% samples contained 


[Ans. 2:3026] 
bability the frequencies ‘f of 


5. 6 7 
9 2 1 
theoretical frequencies. 
361, 217, 87, 26, 6, 1 and 2] 
e number of men killed by the 
rps per annum over 20 years. 
4 and over Total 
1 200 
nd calculate the theoretical 


, 66, 20, 4 and 1 (4 and over)] 


‘oduct is defective. If he sells 
than 10 pins will be defective, 
meet the guaranteed quality ? 


10 5x 
| [an 1 -0.0060 =| 


x=0 7" 


ssuming the following data to 
ind the expected frequencies, 


6 7 8 9 10 
8 6 2 0 1 
', 15, 20, 20, 16, 10, 6, 3, 1, 1] 
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13. Six coins are tossed 6400 times. Using the Poisson distribution, what is the approximate 
probability of getting six heads x times. 


sn git cor 
x! 


14. An area of 144 square kilometres was selected for which the mean density of bombs 
appeared constant. To test the hypothesis that the bombs fell in clusters, the area was 


1 
divided into 576 squares of 4 kilometre each and a count made of the numbers of 


squares containing 0, 1, 2, etc., bombs of which there were 576 altogether. The data is 


given below : 

No. of flying bombs 

per square 0 1 2 3 4 5 and over 
Actua! no. of 

square 229 211 93 35 7 1 


Calculate the theoretical poisson frequencies. 
[Ans. 227, 211, 98, 31, 7 and 2] 


10-3. Normal Distribution 


Normal Probability Distribution. The normal distribution of the variate x with mean 
mand s.d. G is 


dP = ! Bae ih Sees 


The variate x is called normal variate. 
The curve with equation 


is called normal curve. 


If N is the total freq. the corresponding normal curve is 


oon | 
Def : Standard normal distribution function is defined by 


(x) = ee fdde. 


where f(x) = Jon e 2 


(i) To derive normal distribution as a limiting form of binomial distribution. 
Normal distribution can be regarded as the limiting form of the binomial distribution 
when n, the number of trials is very large and neither p nor q is very small. 
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x—np 
Let = (1 
e Zz npq (1) 


where x is a binumial variate with parameters n and p. Since mean and s.d. of x are np and 


pq , the variate z defined by (1) has zero mean and unit variance. As x takes values from 


[np [ng 
0 to n, z takes values from ~ a to on and the jump in the value of z at each stage is 


1 
: Inpq ’ Now as n—> two extreme values of z tend to —co and © respectively and the 


jump at each stage tends to zero. Thus, in the limit we expect the distribution of z to be 
continuous extending from ~°o to co and having zero mean and unit variance. 
In binomial dist. the prob. for the variate x to take value x is given by 


P(x) = "¢xp*q" = ———— p*q 


By Stirling’s formula : 


3, 
R 
tO 
a 
a 
) 
3 
3 
= 
+ 
NN] 


2ne"n 2 ne 


np € 
1 1 
(V2me™*.x 2) (V2ne"** (n-x)" ™2) 

Pre reeves 
1 1 np) 2{ nq 2 
oe) 


Ai : - ar ii nxt 
i" eles ee 
x n-X 


log N = — 


P(x) 


R 


Ne 
N 
a 
= 
3] 


I 
—— 
& 

+ 

| 
ay 
ey 
ge 
I 
— 
= 
| 
a 
+ 
| 
aire 
iy 
ga 
= 
Q 
ad 


From (1) x =np+z./npq 


—|np+zJn a log | l+z ae 
1 lp 
~| nqg—zynpq +— jlog| 1-z,;— 
2 nq 


q P : 
As n is very large and tends to infinity both q, ie and aan can be taken to be less 


than unity and hence both the logarithms can be expanded in series. 
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1 3 
= a + terms cor 


. 


.. log N-> -> z” as 


2 


1 
Le, N»e2 asN>a 
‘1 

Since npg is the in 
its limit by dz. 


“. If dP denotes the : 


1 
zt rhs we have 


This is the required cc 
(ii) To derive normal ¢ 
Normal distribution ca 
when its parameter m is lai 


Let 


where x is a Poisson variat 


variate z defined by (1) ha 


- takes values from _ Jj, to 


m—> oo two extreme valu 
zero. Thus in the limit we « 
to o and having zero mea 
x is given by 
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(1) 
mean and s.d. of x are np and 


iiance. As x takes values from 


the value of z at each stage is 


oo and oo respectively and the 


rect the distribution of z to be 
and unit variance. 
:x is given by 


xX NX 


1P 


log [i + 2] 
np 
1 p 
-zynpg +— | log} 1-z,j— 
2 nq 
Pp - 
<,!—— can be taken to be less 
\ nq 


n series. 


| x is given by 
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1 [4 129 
= —|nptz += + = 7° +... 
log N G npq |: 7 7% es | 
a{ng~2 moa +4) rd ese 
2 nq 2 nq 


1 ; : 
= “5 + terms containing 7 in the denominator 


a log N > -= 2” asn—>o 


(A\ Yen |FOHI- as fr |649 
ie., wesee” as N > 00. gents Methe--lécéo- tes 


oma 
Since Inpq is the increment in z at each stage and tends to zero as n —» © we denote 


its limit by dz. 


1 
.. If dP denotes the probability for the variate z to lie in the interval z 5% and 


1 
z+5 de we have 


This is the required continuous distribution of z and is called Normal distribution. 

(ii) To derive normal distribution as limiting form of Poisson distribution. 

| Normal distribution can also be regarded as the limiting form of the Poisson distribution 
| when its parameter m is large. ° 


| x—-m 


Let Z= ta . (1) 


where x is a Poisson variate with parameter m. Since mean and s.d. of x are m and ym, the 


variate z defined by (1) has zero mean and unit variance. As x takes values from 0 to ©, z 


- takes values from _./j, to oo and the jump in the value of z at each stage is ae Now as 


m-—> oo two extreme values of z tend to —o and oo and the jump at each stage tends to 
zero. Thus in the limit we expect the distribution of z to be continuous extending from —co 
to oo and having zero mean and unit variance. In Poisson dist. the prob. for x to take value 
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Let N= ra 


log N = x-m- [+ 3) lo = 
’ ane 2 e m 
mm cain -(ms +z¥m + i) log (i +) [from (1)] 


- ln-(msevmis Nes 


1 is : 
ae z” + terms containing m in the denominator 


12 


log N>->2? orN+>e 2 asm 


1 
Since ~~ is the increment in z at each stage and tends to zero as m—> 0 we denote 
ta 8 


its limit by dz. 


1 1 
.. If dP denotes the probability for the variate z to lie in the interval 2 — * dz and z+ 2 dz 


we have 


12 
] iret 
dP = e ce dz. 
V2n 
10.3.1. Mean deviation about mean for a normal variate with mean m and s.d. O. 
Sol. Dist. of a normal variate x is 


dP e (SS) dx,-0W<x<0 


.. Mean deviation from mean 


where Az 
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10.3.2. Moments 


(i) Odd order moments . 
For a normal variate wi 


(ii) Even order moment: 
By def. 


Put 7 =2 


fig= 36" Ll, = 30%. 
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pats bien cae eae ae 


(1 + +) {from (1)} 


m in the denominator 


oO aS m—> oo we denote 


1 1 
val 2~5 de and ity de 


mean m and s.d. 0. 


ox<0 
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10.3.2. Moments 


(i) Odd order moments about mean 
For a normal variate with mean m and s.d.o 


dP = —== 


Hons 


\ (=) 
e 8! dy -w<x<0 
ov2n 
= E(x-m)*"*! 
ra) If x-m ; 
| wif zom 
fex-mtte if o le 
ov2n - 
2n+l 1 
o “sy x-m 
forte 2" dy where y = 
V2n o 


oO 


(ii) Even order moments about mean 


By def. 


Put 1 =2 


*. Hg = 307 LH — 304. 


Hon, = 


67 (2n-1) pon 


385 


(as integrand isanodd f”), 
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Ex. 10-52. Show that 

Mo, =13.5.......(2n—-1) 02". 
Sol. By recurrence formula 

Han = (2n-1)07 poq-r 
Put n=n, n-l,......... , 2, 1 

Hon = (20-1) 07 Hoya 


Hon-2 = (2n—3) o* Hon-4 


H2 = Lo? Ho =1.¢7 C. Hy = 1) 
Multiplying 
Hon = LB. (2n-1Do*, 
Ex. 10.53. Let x be a N(m, 0) {i.e., a normal variate with mean m and s.d. 6}, then. 


d , 
@ lg = mpl, +(0? —m?)pt +03 Se 
do 
where wt! denotes the rth moment about zero. 
i | du 
(ii) Hore2 = "to, ora 
Sol. By def. 
MH, = E(x") 
o  _I(acmy 
= : jee " Mae 
re) 
cs) 1f x- 
due Ale 1 ject) dx 
do o?J2n ° 


oVv2n o 
1f x-m 
H, 1 } r 2. 2 422) 
= —— 4+ ——— J x (x -2xmim' Je dx 
6 ofV2n 
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(ii) By def. 


Nn 


H, 


10.3.3. Measures of Skewness a 
We have 


Since y, =0 dist. is called: 


10.3.4. Moment Generating Fu 


By Def., 


Mo 


where 
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iif 


(Wo = 1) 


n mand s.d. G}, then. 


do 


a 2 
2xm+ ae) dx 


1 di 
7 Bret 
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I 
N 
= 
= 
z 
+ 
fo} 
i) 
| 
3 
=e 
+ 
Qa 


' — 
H+ 


(ii) By def. 


du 1 ar, 26 
—t = -> = fox-m "e 
dso o'v2n * 


o s co 
ae _ Hor Hors2 
o o 
2 3 dus 
Hopeg =O Par TS sae 
10.3.3. Measures of Skewness and Kurtosis 
We have 
H3 = 0,p4 =304 
M3 Ha 
= —; =OandBp, => =3 
B= 43 M3 


11 = ¥B, =Oandy, =B,-3=0 


Since y, =0 dist. is called Normal dist. 


10.3.4. Moment Generating Function 


By Def., 
i (any 
Molt) = Ele"}=——= [e*.e % 9) ax 
olin J 
eo 1 
= I femme?” dy 
V2n Le 
x-m 
where Lien 
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12 2 2 1 
= Paar 6 1 fe ie . | ee . 
fon i where a’s are constants. 
sahieye = 9-96 Now M(t) of 
143 
tm+—t'o 
. M.G.F. about mean m is given by P 
M;(t) = Efe} = e"™ Efe") 
1 2.2 1 2.2 
mt+—U 0" =o" 
=e™e 2 =e? 
Deduction 
1 2.2 
-o~ 
M(t) = €? 
(: 262 1 ag 4 which is the m.g.f. of a normal var 
a 142 2 2 
1+(54 o + TT tet Fe tee . .. 41s a normal variate with n 
i aa Ex. 10-55. Jf, the independent v 
2nt+l 
1 o2" Post 
and: ar | ee the common mean jp, with a comme 
(2n)! 2” nn! 
1 (2n)! an 
= Hon = aye 0 - normally distributed about the san 
(2 = L.erseessernes 3107". 


Sol. Here m, = my =.....=m 
10.3.5. Cumulative Function and Cumulants 


By def., cumulative f” is given by U : ye xi 
u=—) x; =X is a norm: 
1 


Les 7 i=0 
mt+—o 1 2.2 ; 
Ko(t)=log Mo(t) = loge 2 =mt+—t'o : F 
. Y fee. 
t? nn 
But Ko(t) = kyt+k, aye Ex. 10-56. Show that for the N 


Sol. The density curve for the 
where k,,k2,.....+. are various cumulants. ' 


k,(0) =m, ky =07, ks =0, ky =0,........ | 
hus, all cumulants after the second are equal to zero. 


Ex. 10-54. Show that a linear combination of independent normal variates is also a t Put x-n 
normal variate. : 
Sol. Let x,, x,,......x, be independent, normal variates with means m,,my,......Mm, ; 


and s.ds.01,04,.....0,,- \ 
| which is evidently symmetrical abc 
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—_ 


2 


dz 


my 


2n 


1 
to 


lent normal variates is also a 


s with means 1, M,...-..My, : 


ei vimsatni gaan “ie 
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where a’s are constants. 


Let 


Now 


M,(t) of u = 


E{e"ar FAA y srerssssectne +4,Xy,) } 


Ele), Be")... Ee) 


{" (4), X2,....are independent} 


{taym, dca, 0,7} {(tay )m, ha, y 63} 
2 e 2 


(td, ym, +510, 04?) 


which is the m.g.f. of a normal variate with mean La,m, and variance Sia? ai: 


. wis a normal variate with mean La,m, and variance La?o?. 


Ex. 10-55. /f the independent variates x;(i =1,2,......2) are normally distributed about 


the common mean y, with a common variance o’, show that their mean 


2 


. : o 
normally distributed about the same mean \ but with variance —. 


Sol. Here m, = my =..... 


=m, =U 


1 
and a, =a) =..... Oa 


n 


1 
a >, ~i | is also 


i=l 


n 
1 ee : : Dae : 
Ue -) x =X is a normal variate with mean La;m; =—— = and variance = 
n n 


i=0 


Ex. 10-56. Show that for the N.D. mean, mode and median coincide. 
Sol. The density curve for the N.D. is 


Put 


<< 
ll 


y= 


ov2n 


which is evidently symmetrical about the line X =0i.e., x =m. 
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nh x = mis the median. 

Also evidently y decreases continuously as X increases numerically and is maximum 
for X =0. . 

“.X =Oi.e.,x =m is the mode. 

“. Mean = Mode = Median = m. 

Ex. 10-57. Find the points of inflextion of the normal curve. 

Sol. The eq. of the normal curve is 


lf x-m ' 
pee l } (1) 
ov2n 
At the points of inflextion 
a a 
: = 0 and a= #0 
dx dx 


3(x-m) pica) 


d?y 1 A(axy {- zy ; 
x 


and —, = e 
dx} oV2n o «4 
a 
~ Put ae = 
(x- wh 1=0 
o 
or ; x= mto. 
; 1 
j d° es ee ee 
At , x= mio, — >= etl eS} 
dx” av2n o 6 
ot 
=t e*#0 
ot V2n 
.. At the points of inflextion 
x=m to 
dh fi 1 ; e? 
e from = 
and hence from (1) y - Jon 
Ex. 10-58. Give chief features of the normal curve. 
Sol. The eq. of the normal curve is 
1(x-mY 
: Aw) eh} 


I 
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(7) Mean, mode and median 


(ii) Since y becomes zero whe 
negative and positive sides, at infir 
and.positive sides. 


(iii) At the points of inflextior 


Evidently the points of inflext 


(iv) Maximum value of ording 


Ex. 10-59. Deduce the first fo 
Jrom those of (i) the Binomial dist. 


Sol. (7) For B.D. LL. 
Ll. 
Hl, 
Let 


when x is a binomial variate with p 


Then mean of . 

uy for z= E(z—0)° 
113 for z= E(z*) 

uy for z= E(z*) 


Now as 1 > ©,z—> a normal 


Ly for normal variate 


3 for normal variate 


Lt, for normal variate 
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Eo eee ae 


(7) Mean, mode and median of the normal curve coincide. 


wumerically and is nea (ii) Since y becomes zero when x is numerically infinite, curve touches x-axis both on 
negative and positive sides, at infinity i.e., x-axis is asymptote to the curve both on negative 
and positive sides. 


(iii) At the points of inflextion 
ve. i x=mto 
Eyidently the points of inflextion are equidistant from x = m. 


(iv) Maximum value of ordinate is 


. (1) 
ol 
_ ov2n 
Ex. 10-59. Deduce the first four moments about the mean of the normal distribution 
from those of (i) the Binomial dist. (ii) the Poisson distribution. 
I Sol. (‘) For B.D. ll. = pq 
{="} LL, = npq(q-p) 
Lg = npg {1+3(n—-2) pq} 
"i 
x-n 
2 Let L. P 
i npq 
| when x is a binomial variate with parameters n and p. 
E(x -—np) 0 
Th Leyla Ra 
en mean of z (z) \npq 
) it 2 a 
ay tt, for z= E(z—0)* = —— E(x-np)* =1 
, npq 
1 q~ 
ww for z= E(z?) = : E(x-np)’ = 42 
. v"pq 
2 a ee < (npq) 
ee feta} 1 1+3(n—2) 
3 oO n- 
. 4 for z= E(z*) = = E(x-np)* = Pd 
‘ (npq) npq 
=e 2+#0 Now as 1 —> ©, z—> a normal variate 
1 
kt, for normal variate = LDS for z=1 
& GP _ 
8 D ui; for normal variate = at M3 for z= art mee 
: : npq 
ut, for normal variate = ee H4 for z 
le 1-6pq 
= Lt ;——+3/=3. 
. NO npq 


(1) 
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(ii) For P.D. nh, =m 
a 
bg = mt 3m? 


x-m 
Let ZS fey 

nm 
where x is a Poisson variate with parameter m. 


E = 
Then mean of z = —— =0 
An 


1 
— E(x- m)" =] 
nt 


ut, for z= E(z)’ 


l ae | 
for z= E(z°) = E(x-m)y =—= 
shea amet im 


1 1 
4 for z= E(z*) = —y E(e-m)* =— +3 


m U 


As m— 0, Z— a4 normal variate. 


‘ate = Lt for z=1 
11 for normal variate = 1, 2 


13 for normal variate = te H3 for z= oe =0 


| 
ug for normal variate = Lt (3 + ) a5; 
ms m 


Ex. 10-60. For a certain normal distribution the first moment about 10 is 40 and that 
the 4th moment about 50 is 48, what is the A.M. and s.d. of the dist.? 
Sol. Let m and o be A.M. and s.d. 


Now py (10) = 40 
E(x-10) = 40 
or E(x) = 50 
m = 50 
Also py, = 48 
304 = 48- 
i o =2. 
Ex. 10-61. IfX is a normal variate with mean 30 and s.d. 5. Find the probabilities that 
(i) 26< X < 40, (ii) |X — 30] > 5. 


Sol. (i) P(26 < X < 40} = P{26 < X < 30} + P{30< X < 40} 


X-30. 
5 


Put . Z= 
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(vi) 


Ex. 10-62. For a normal disti 
variate such that the probability. 


Sol. Let X be normal variate. 
: Then dist. of X is 


ia 


Put 


0-41 


Ex. 10-63. Prove that, far the 
and the s.d are approximately in 


Sol. Let Q, and Q, be the C 
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moment about 10 is 40 and that 
f the dist.? 


s.d. 5. Find the probabilities that 


< 40} 
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P{-0-8<Z<0}+ PIO<$Z<2} 


P{0<Z<0-8}+ P{0<Z<2} 


0-2881 + 0-4772 = 0-7653. 
(using normal tables) 
(ii) P{|X —30| > 5} =1- P{|X -30| < 5} 


= 1-P{25<x<35} 


1-2P(30< X <35} 


1-2P{0<Z<}} 


1-—2(0-3413) 


0-3174. 
Ex. 10-62. For a normal distribution with mean 2 and variance 9, find the value x of the 
variate such that the probability of the variate lying in the interval (2, x) is 0-4115. 


Sol. Let X be normal variate. 
Then dist. of X is 


1/(X-2)) 
1 3 3 
= e dx 
dP 320 
ie 
_ Ce) 4 
P(2<X <x) as 
Put X-2 _ 
u 3 =Z 
a2 
eo 
1 -zz 
= 2 d 
Te jie 2 
x-2 


“ZL 
O-ANIS J e? dz 
0 


x= 2+4-05=6-05. 
Ex. 10-63. Prove that, far the normal distribution the quartile deviation, mean deviation 
and the s.d are approximately in the ratio 10:12:15. 


Sol. Let Q, and Q, be the Quartiles 
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Then P{x<Q,} = 0-25 
x=Q,; x=m x= Q, 
P{Q, <x <m} = 0-5-0-25 
= 0-25 
m _1(x-m 
fe A Ng 25 
ov2n 2, 
Put m—-xX _ 
u as =y 
m-Q, 
io Oe wae 
| e 2” dy =0-25 
V2n 0 “a 
m=O _ 9.6744 : (1) 
Oo 
Also P(x 2Q3} = 0-25 
P{m< x <Q3} = 0:25 
Q, \f.x-m i 
1 -3(2"} 
—= dx =0-25 
oynE 
Q3-m 
1 Ff -y 
or ae J e? dy=0-25 
20 4 
Q,-m 
SC 0-6744 2) 
From (1) and (2) 
Q3- 2 _ 0.67440226 
2 3 


-. Quartile Deviation 


2 
=-6o 
3 
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Also Mean deviation 


Q.D. 


Ex. 10-64. /ftwo normal i 
of universe A is k times that of 


1 
is ra times that of universe B. 


Sol. Let N be the total fre: 
Then 


Let m, and m, be the 4. ] 
The frequency functions c 


and ] 
Evidently F4(x) is max 


[F,(2 


Similarly [FQ 


[F4(x 
[Fg(x 


[FyQ 


Ex. 10-65. Assume the me 


10-8 (in)’. How many.-soldier 
tall ? (Given that the aréa und 
0-1368 and between x = 0 ana 

Sol. Let x inches be the hr 


Then x is a normal variate 
. Dist. of x is 
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ae 
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4 
Also Mean deviation =—O 


Q.D.:M.D.:S.D.::10:12:15. 


Ex. 10-64. If two normal universes A and B have the same total frequency but the s.d. 
of universe A is k times that of the universe B, showthat maximum frequency of universe A 


1 
is k times that of universe B. 


Sol. Let N be the total frequency and 6,,6, be the s.d. of A and B. 
Then 6, = ko, 


Let m, and m, be the A. Ms. of A and B. 
The frequency functions of A and B are 


Fas) = ue) 


and F,(x) = e 
, o,V2n 


Evidently F(x) is max for x = m,. 


N 

(Ql) FC) inax = o,V2n 
: _N 

ede Similarly [Fe () max = Oo, Jon 


[Fa(x)]max 2 _1 
[Fg (x) max 1 k 


1 ~S 
[F(X Imax = 7 fa Imax. 


Ex. 10-65. Assume the mean heights of soldiers to be 68-22 inches with a variance of 


10-8 (in)’. How many soldiers in a regiment of 1000 would you expect to be over 6 feet 
tall ? (Given that the aréa under the standard normal curve between x = 0 and x = 0-35 is 


--(2) 0: 1368 and between x = 0 and x = 1-15 is 0-3746). 


Sol. Let x inches be the height. 


Then x is a normal variate with mean 68-22 inches and variance 10-8 (in). 
.. Dist. of x is 
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2 . 
1 9 Re) 
P{x > 72} = 0-5- J => —e dx 
x=68.22 VIO-8 V2n 


x— 68-22 


Put z= ~ Vi0-8 


I 
P{x>72} = 0-5- e? dz 
fe) 
= 0-5-(0-3746) (given) 


= 0:1254. 
“. Ina regiment of 1000, the number of soldiers taller than 6 feet. 
= 1000 x 0-1254 = 125-4 


w 125. 
Ex. 10-66. Jf logyg x is normally distributed with mean 4 and variance 4, find the 


probability of 1-202 < x < 83180000. 

(Given logig 1202 = 3-08, log;g 8318 = 3-92). 

(b) logig x is normally distributed with mean 7 and variance 3. log) y is normally 
distributed with mean 3 and unit variance. If the distribution of x and y are independent, 
find the prob. of 
| 1-202. << <83180000. 


Sol. (a) Let y = logig x. 
Dist. of y is 
_1fy-4" 
dP = ] e 1 . ie 
2V2n 


Now P{1-202 < x < 83180000} = P{0-08 < y <7-92} 


2 


7-22 44} 
Je 2025 gy 
008 


1 
~ oon 
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Put 


P{1-202 <x < 83 


(b) Let z, = logig al 


Then z= z — 2, is alse 
.. Dist. of z is 


Now 


Ex. 10-67. If the skulls 
is under 75, between 75 ane 
normal) the mean and s.d. « 


given that if 


é then f(0-20) =0-08 a 
Sol. Let m and o be th: 
Then dist. of x is 
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(given) 


than 6 feet. 
5-4 


xan 4 and variance 4, find the 


rariance 3. \ogi9y is normally 
on of x and y are independent, 


dy 
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Put 


P{1-202 < x < 83180000} = Jon 


1-96 1, 
a J e 2 dz=2(0-4750) 


(from normal tables) 
= 0-95, 


(b) Let z, = logig x and 22 = logy y. 


Then z= 2, —2Z, is also a normal variate with mean 7—3 = 4 and variance 3+1 = 4. 


.. Dist. of z is 
z-4)" 
coy 
dP = é dt 
2V2n 
Now Phi 202<2< so 180000) 
y 


P{0-08 < z< 7-92} 
= 0-95 {from (a)}. 
Ex. 10-67. [f the skulls are classified A, B and C according as the length breadth index 
is under 75, between 75 and 80 and over 80, find approximately (assuming that the dist. is 
normal) the mean and s.d. of a series in which A are 58% B are 38% and C are 4% being 
given that if 


then f(0-20) =0-08 and f (1-75) =0- 46. 
Sol. Let m and o be the mean and s.d. respectively and x be the length breadth index. 
Then dist. of x is 
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x=m x = 80 


Now P{x <75}=0-58 which is greater than 0-50 and hence the ordinate x = 75 is on 
the right of x =m. 
From fig., P{m <x < 75} =0-08 


5 { x-m 


al 26 eee 
fey TT 
m 


Put 


.. From given. 


75-m _ 


= 0-20 ...(1) 
Again P(x > 80) = 0-04 
P{m < x < 80} = 0:46 


fT) = 0-46 
(eo) TT 


m 


g 1, 
or J e? 
V2n 
m 


ae 1-75 (from given) ...(2) 


From (1) and (2) 
m= 74-4 (approx.) 
o =3-2 (approx.) 

Ex. 10-68. One thousand candidates in an examination were grouped into three classes 
LH, lll in descending order of merit. The numbers in the first two classes were 50 and 350 
respectively. The highest and lowest marks in class II were 60 and 50 respectively. Assuming 
the distribution to be normal, prove that the average mark is 48-2 approximately and standard 
deviation 7-1 approximately. Given that : 
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x 
Go 


0. 
0°: 
0-4 
where the area A is meas 


Sol. Number of cand 


=1000 - 

Also 
P{n 

Put 
P{o <z 


from given data 


ip 
Value of A for — =( 
fo} 

x 
Value of A for. x = ( 


.. Increment in A fo 


x 
.. Increment in — | 
oO 
x 
“. Value of - (for. 


Also 
Pir; 


plo<z 
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0.04 


em 


80 


ce the ordinate x =75 is on 


..-(1) 
given) ...(2) 
Tox.) 

TOX.) 


re grouped into three classes 
‘wo classes were 50 and 350 
1d 50 respectively. Assuming 
‘approximately and standard 
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x x 
_ A =: 
o o 
0-2 0-079 1:5 
0:3 0-118 1-6 
0-4 0-155 1:7 


A 


0-433 
0-445 
0-455 


where the area A is measured from the mean zero to any ordinate x. 


Sol. Number of candidates getting III class. 
=1000-(350+50) = 600 
— Pix< 50} = 0-6 
Also — P{x <m}) =0°5 
P{m <x <50} =0:1 


Put z= 


from given data 


Value of A for <= 0-2 is 0/079 
Value of A for ~=0-3 is 0-118 


x 
.. Increment in A for increment 0-1 in — 
o 
= 0-039. 
. : ‘ 
.. Increment in 3 for increment 0-021 in A 


_ 0-1 


= 0-021 =0-054 


0-039 


:. Value of = (for A = 0-1) = 0-2 + 0-054 


= 0-254 
ASA BOLT 
Also P(x > 60} = 0:05 


P{m <x <60}-= 0-5-0-05=0-45 


po Zz <a = 0:45 
oO 
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(1) 
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As above, from given data 


60- 
= = 1-65 id) 
From (1) and (2) 
o =7:1 (approx.) 
and m = 48-2 (approx.) 


Ex. 10-69. Jn a normal dist. 31% of the items are under 45 and 8% are over 64. Find 
the mean and s.d. of the distribution. 


Sol. Let m be the mean and o the s.d. Then 
P{x < 45} = 0°31 
P{45<x<m} =0:19 


or po" <2 <a} = 019 
o 
0.31 0.08 
x= 45 x=m x = 64 
plo<z< 45} = 0:19 
a= = 0-496 (1) 


64-m 
Similarly plo aor = 0-42 


- 1-405 ..(2) 
From (1) and (2) 
m = 10-(approx.) 
o = 50 (approx.) ; 
Ex. 10-70. In a distribution exactly normal, 7% of the items are under 35 and 89% are 
under 63. What are the mean and s.d. of the dist. ? 


Sol. P{x < 35} = 0-07 
P{35<x<m)} = 0-43 


plo<z< a = 0:43 
Oo 


A a 
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0.0 


From (1) and (2) 


Ex. 10-71. Five thous 
a maximum of 100 marks. 
39-5 and s.d. 12:5. Dete? 
class for which a minimu 
The proportion A of 


x 
at the deviation is 


SQ |x 


Sol. P{39-5< x < 66 


.. No, of students g 
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2) 


id 8% are over 64. Find 


0.08 


(1) 


...(2) 


‘e under 35 and 89% are 
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—— "= = 1-476 vai 
5 1:47 (1) 


P{x < 63} = 0:89 
P{m <x <63} = 0-39 


plo<z < Bal = 0:39 


oars S226, ...(2) 


From (1) and (2) 
o = 10:36 (approx.) 
m = 50-29 (approx.) 
Ex. 10-71. Five thousand candidates appeared in a certain examination paper carrying 
a maximum of 100 marks. It was found that the marks were normally distributed with mean 
39-5 and s.d. 12:5. Determine approximately the number of students who secured a first 
class for which a minimum of 60 marks is necessary you may use the table given below : 
The proportion A of the whole area of the normal curve lying to the left of the ordinate 


at the deviation = is 
o 
1:5 1:6 1:7 1:8 


0:93319 0:94520 0:95543 0:95407 
Sol. P{39-5 < x < 60} 


= |x 


P(0 <z<1-64} = 0-94929 -0-5 
0-44929 
P{x > 60} = 0-5-0-44929 

= 0-05071 


I 


x = 39.5 x= 60 
-. No, of students getting first class = 5000 x 0-05071 = 253. 
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Ex, 10-72. A minimum height is to be prescribed for eligibility to government services 
such that 60% of the youngmen will have a fair chance of coming upto that standard. The 
heights of youngmen are normally distributed with mean 60-6" and s.d. 2-55”. Determine 
the minimum specification. 


From table if f(t) = 


oa) 2 : : 
Fe { exp [- 4 dt, then f (-0-2533) = ol 
TT 


Sol. Let h be the minimum height prescribed. 


ee (22) 
2. a 
Th Je dx =0-6 
x-m 
Put ; t= 
o 
1 7-0? 
—— Je? dt=0-6 
Qn h-m 
o 
BEM 39.9533 
Here m = 60:6, o =2-55 
h= 59-95= 60. 


Ex. 10-73. The local authorities in a certain city installed 2,000 electric lamps in streets. 
If the lamps have an average life of 1,000 burning hours with a s.d., of 200 hours. 


(a) What number of lamps might be expected to fail in first 700 burning hours ? 


(b) After what period of burning hours would you expect that 10% of the lamps would 
have failed ? 
Assume that lives of the lamps are normally distributed. 


ar 


1 f-> 
Given that if F(t) = Von Je dt.. | 
Then F(1-50) = 0-933 
and F(1-28) = 0-900. 


Sol. Let x hours be the life of a lamp. - 
(a) Since normal curve is symmetrical about x = 1000 


P{x < 700} = P{x>1300} 
1~ P{x < 1300} 


| 


) 


1300 (2 
oe ee Je? . 
ov2n © 
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Here 


Put an 


P{x 


.. Number of lamps ext 


(>) Let x =x, bes.t. 
P{ 
P{ 


or 


: Or 


Let 
P| 
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to government services 
upto that standard. The 
ds.d. 2:55”. Determine 


2533) = ot 


electric lamps in streets. 
, of 200 hours. 


0 burning hours ? 


0% of the lamps would 
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1000, o = 200 


Here m 


x= 700 x = 1000 x = 1300 


Put Z= = 


o 200 
1-5 1), 
P{x < 700} = 1- fe 2° dz 
2m 
= 1-0-933 =0-067 


.. Number of lamps expected to fail in first 700 hours of burning 
= 2000 x 0-067 = 134 
(db) Let x =x, bes.t. 
P{x > x,} =0-1 
P{x<x,} =0-9 


or 
x)—m 
1 F -2 
- or — J e* dz=0-9 
2n 
x;-m 
= 1-28 
x, = 1000+ 200 (1-28) = 1256 
~ Let X= X, bes.t. 


P{x<x,} =0O1 
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Then by symmetry of normal curve about x = 1000, 
' = 1000-256 = 744 


*. After 744 hours of nee 10% lamps are expected to fail. 

Ex. 10-74. The incomes of a group of 10,000 persons were found to be dovmitty 
distributed with mean = Rs. 750 p.m. and s.d. = Rs. 50. Show that of this group about 95% 
had income exceeding Rs. 668 and only 5% had income exceeding Rs. 832. What was the 
lowest income among the richest 100 ? 

Sol. Let x be the i income variate 

Here m=750, o=50 


(i) Ply > 668} = 0-5 + P(668 < x < 750} 
= 0-:5+P{-1-64<z<0} 
= 0-5+ P{(0<z<1-64} 
= 0-5+0-4495 
= 0-9495 


x = 668 x= 750 


.. Percentage of persons having income exceeding Rs. 668 
= 94-95 = 95% 
(it) ; P{x > 832} = 0-5- P(750 < x < 832} 


= 0-:5-P{0<z<1-64} 
= 0-5-0-4495 = 0-0505 
.. Percentage of persons having income exceeding Rs. 832 
= 5% 
(iii) Let x= x, bes.t. 
P{x>x,} =0-01 
Then x = x, is the lowest income among the richest 100. 


P{750<x<x,) =0-49 


x = 750 X= X, 


Pec 
or plo<z< ' | oa, 
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xy 


Ex. 10-75. In a certain 
obtained were 50% and the s. 
than 60 inurks, supposing the 
normal curve from x = 0 to x 

Sol. Let x denote the ma 

Then x is N (50, 5) 


Now PO 
Put 

eed 

PC 


.. Expected number of : 


Ex. 10-76. [fx and y are 
16 respectively, determine X 


P(2x+ 
Sol. Let u=2x+y,v= 
Then uisal 

mean = 


' 2 2 
and variance = 40," +0, 
=4.9+ 
and v is a N.V. with mean 
v = 4x 
5 
and variance = 160,° + 90, 


=16.9 
Now P(2x 4 


Put 
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fail. 

were found to be normally 
hat of this group about 95% 
ding Rs. 832. What was the 


0.01 
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x, 7 


¥ 


0 
= 2:3267 


“ x, = 866:34. 

Ex. 10-75. In a certain examination 2000 students appeared. The average marks 
obtained were 50% and the s.d. was 5%. How many students do you expect to obtain more 
than 60 inurks, supposing the marks to be distributed normally? (Area under the standard 
normal curve from x = 0 to x = 2, is 0-4772). 

Sol. Let x denote the marks obtained. 


Then x is N (50, 5) 
Now P(x > 60) = 0-5— P{50 < x < 60} 
Put 7 x-50 

u Zz ae 


P(x>60) = 0-:5-P(0<z<2) 
= 0-5-0-4772 = 0-0228 
.. Expected number of students getting more than 60% marks 
= 2000 (0:0228) 
= 45-6= 46. 
Ex. 10-76. /fx and y are independent normal variates with means 6, 7 and variances 9, 
16 respectively, determine d such that 
P(2x+y Sh) = P(4x—-3y 2 4A) 
Sol. Let u=2x+y,v=4x-3y 
Then uisa NV. with 
mean = 2x+y=26+7=19 


. 2 2 
and variance = 40," +0, 


=4.9+16=52 
and v is a N.V. with mean 
v=4x-3y =46-3.7=3 


; ‘ 2 2 
and variance = 160,° +90, 


= 16.9+9.16 = 288. 
Now PQx+yShk) = Plusd) 


u-19 


Put Zh 52 


406 
Then z, is N(O,1) and 


=> 


u= 


P(2x+ySA) = 


Similarly  P(4x—3y 24) 


Put 


Then z, is N(0, 1) and 


P(4x-3y24)) = Pls 24 


oe 


Since z,,z, both are N(0,1), we can replace both of them by z (which is also N (0, 1). 


“. GivenEq. => P fs s 


pjeshe 
¥52 
1-19 4X.-3 

=> —— and 


V52 288 must be on the opposite sides of z = 0 (which is mean of z) and 


are equidistant from it. 


2 = 


Va 


v= 


\. 


19 +52 z, 


P(1I9+-V52 z, <A} 


4-19 
P<z<——— 
{a= 
P(v24h) 
v-3 


¥288 


3+ 288 z, 


40-3 
P\ 22 == 
(== 
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\ 


_An-3 
288 
114V2 +3V13 


7-51. 


~ 6V2+4Vi3 


(QL) 


(2) 
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Ex. 10-77. If x is a N (2,3 


Sol. 


Put 


Ex. 10-78. Jf N(r) = P(x: 
Sol. Since normal curve is 
PO<sx 


Now 
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3 
Ex. 10-77. [fx is a N (2, 3), find P | ¥ 2 2 where y= xX-1. 


3 
Sol. Now Ay 2 3] 


(1 
a p[x-123] 
2 


= P(x 22-5) 
0-5-P(2$x<2-5) 


Put ~ 2=2 
u Zz 3 


3 
Ay23| = 0-5-P(0<z<0-17) 


..(2 
(2) = 0-5--0675 
= 0-4325 

2m by z (which is also N (0, 1). 

z= 0 (which is mean of z) and x=2 X= 2.5 

: Ex. 10-78. [f N(r) = P(x <r), where x is a N (0, 1), show that N(-r) =1— N(r). 
Sol. Since normal curve is symmetrical about x = 0, 
POO<xsr) = P(-rsx<0) .(1) 


x=0 


Now N(-r) = P(x S-r) 
= 0-5~-P(-r<x<0) 
= 0:5-P(Osx<r) 
= 1-{0-5+P(0<x<r) 


408 . MATHEMATICAL STATISTICS 


= 1-P(x<r) 
= 1-N(r). 
Ex. 10-79. Let x be normally distributed with mean (> 0) and s.d.o. Suppose ¢? is 


some function of say o? =h(u). Choose h(.) so that P(x <0) does not depend on p. 


x-u 


; Sol. Take z= 


P(x <0) = r[zs-4] 
oO 


for this to be independent of Ht, take 
o =H. 
Then P(x <0) = P(zs-1)=®(-1). 


Ex. 10-80. /fx is normally distributed with mean 1 > 0 and variance 6” =”, evaluate’ 


P(x<-p/x<p). 
Sol. P{(x<-pw) N(x <p)} 


x=y 


P(x <p) P{x<—-p/x<p} 
P(x <p) = 0-5P(x <-p/x <p) 
{2 (x <-W (x <p) 
=(x<-)t 
P(x<—p/x<p) = 2 P(x<-p) 


Put Z= Pact 
S H 
= 2P(z<-2) 
= 2P(z>2)=2{0-5— POS z<2)} 
= 2{0-5-0-4772} 


_ = 0-0456 
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1. Ifx is normally distributed 


. Show that the s.d. of a norm: 


deviation about the mean. 


. Ifx is normally distributed w 


of the standard normal cumu 


. xis anormal variate with mea 


(i) Values of the probability 
x = 0, 46-6275, 50,53: 37: 
(ii) Probabilities over the in 
(—00, 46-6275), (46-6275, 5 
Comment on the various res 


[ive that ae = 0-3989 


V2n 


. x is anormal variate with p1 


f(x) = 0-3989 exp {-0-00£ 


Express f(x) in the standar 
of x. 


. x is a norma! variate with p: 


f(x) = 0-7978 exp. {-2x? 4 


Express f(x) in standard fc 
the distribution of x. 


. Assuming a Normal distrib 


(i) the number of observa 


(ii) the value of the variat: 
[Hint : for (ii) ifr be the ve 


Pi 
=> P( 
Put Zz 
=> P 
8 
zi bide 
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4(> 0) and s.d.o. Suppose «2 is 


P(x $0) does not depend on . 


). 


| and variance o* = 1’, evaluate 


u/x<p} 


-P(0<z<2)} 
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4. 


EXERCISES 
If x is normally distributed with mean 2 and variance 2, find P{|x-1| <2}. 


[Ans. 0:7624] 
Show that the s.d. of a normal distribution is approximately 20% more than the mean 
deviation about the mean. 


Ifx is normally distributed with mean 2 and variance 2, express P{|x— 1| <2} in terms 
of the standard normal cumulative distribution function. 

x is anormal variate with mean 50 and variance 25. Set out in tabular form the following : 
(i) Values of the probability density function for 

x = -00, 46-6275, 50,53-3725 and + ©. 

(ii) Probabilities over the intervals 

(-20, 46-6275), (46-6275, 50), (50, 53-3725) and (53-3725, + 0). 

Comment on the various results that you obtain. 


1 1 —~+(0.6745)? 
Given that —— = 0-3989, 2 


-0.185 


. x is a normal variate with probability density function 


f(x) = 0-3989 exp {-0-005x? +0-5x-12-5} 


Express f(x) in the standard form and hence or otherwise find the mean and variance 
of x. 


. x is a normal variate with probability density function 


f(x) = 0-7978 exp. {-2x” + 4x—2} 


Express f(x) in standard form and hence or otherwise find the mean and variance of 
the distribution of x. 


. Assuming a Normal distribution with N = 1000; » = 80; o =15, find ° 


(i) the number of observations expected to lie between 65 and 110. 
(ii) the value of the variate beyond which 100% of the items lie. 
[Hint : for (ii) if r be the value, then 


P(x2r)=1 
=> P(r < x $80}=0-5 
Put 80-x 
u = 
15 
an ploses “}-0 5 
80-r 
= 3-92 
ae 15 
> r=21-2 


[Ans. 818] 
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a 


9 


10. 


11 


12 


13. 


14. 


15. 


Express as an integral the probability that a normal variate with mean 5 and s.d. 2 
would be observed between 2 and 3. 

The following table gives frequencies of occurrence of a variable x between certain 
limits : 


Variable x Frequency 
Less than 40 j 30 
40 or more but less than 50 33 
50 and more 37 


The distribution is exactly normal. Find the distribution and also obtain the frequencies 
between x = 50 and x = 60. 

- [Ans. 11-68, 46-125, 25] 
In a certain examination the percentages of passes and distinction were 45 and 10 
respectively. Estimate the average marks obtained by the candidates, the minimum 
pass and distinction marks being 40 and 75 respectively. (Assume the distribution of 
marks to the normal). 

[Ans. 36-1] 

The marks obtained in a certain paper are found to be normally distributed. If 12:-5% 
of the candidates obtain 60% or more marks, 39% obtain less than 30 marks, find the 
mean number of marks obtained by the candidates. Given 


= 0:27 0-28 0:29 1:14 1-15 1-6 
A 0-6064 06102 0-6141 0°8727 0-8749 0-8770 


. [Ans. 36] 
The height measurements of 600 adult males are arranged in ascending order and it is 
observed that the 180th and 450th measurements are 64-2” and 67:8” respectively. 
Assuming that the sample of heights is drawn from a normal population, estimate the 
mean and the s.d. of the population. 
Steel rods are manufactured to be 3 inches in diameter but they are acceptable if they 
are inside the limits 2:99 inches and 3-01 inches. It is observed that 5% are rejected 
under size. Assuming that the diameters are normally distributed, find the s.d. of the 
distribution. Hence calculate what proportion of rejects would be if the permissible 
limits were widened to 2-985 inches and 3-015 inches. 

[Ans. 1:36%] 
In an examination it is laid down that a student passes if he secures 30% or more 
marks. He is placed in the first, second or third division according as he secures 60% 
or more marks, between 45% and 60% and marks between 30% and 45% respectively. 
He gets a distinction in case he secures 80% or more marks. It is noticed from the 
results that 10% of the students failed in the examination, whereas 5% of them obtained 
distinction. Calculate the percentage of students placed in the second division. (Assume 
marks to be distributed normally). 

[Ans. 34%] 


If x is a normal variate with mean 50 and s.d. 10, find P(y < 3137) where y= x? +1. 
[Hint. P(y < 3137) = P(x? +1<3137) 
= P(x? < 3136) 
= P(|x]S 56) 
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eaten ae ESS 


DS ge 504 


Put z= zai 
P(y< 

= P{- 

= P{- 

= Pf- 

=0.! 

=0-’ 

10-4. Discrete Uniform Distrit 
This distribution is of the fi 


‘: 


where n is a positive integer. x i 


Mean and Variance 


M.G.F. 
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variate with mean 5 and s.d. 2 


of a variable x between certain 


1and also obtain the frequencies 


[Ans. 11-68, 46-125, 25] 
nd distinction were 45 and 10 
7 the candidates, the minimum 
‘ly. (Assume the distribution of 


[Ans. 36-1] 
normally distributed. If 12-5% 
ain less than 30 marks, find the 
ven 


1-15 1-6 
17 08749 0-870 
[Ans. 36] 


xed in ascending order and it is 
64-2” and 67-8” respectively. 
ormal population, estimate the 


but they are acceptable if they 
observed that 5% are rejected 
listributed, find the s.d. of the 
ts would be if the permissible 


[Ans. 1:36%]. 
:s if he secures 30% or more 
1 according as he secures 60% 
en 30% and 45% respectively. 
marks. It is noticed from the 
, whereas 5% of them obtained 
ithe second division. (Assume 


[Ans. 34%] 
(y $3137) where y = x? +1, 
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Put z= = 0, y= 504102 


Ply $3137) = P(-56 < x < 56) 
= P{-56 < 50 +10z < 56} 
= P{-10-6 <z<0-6} 

= P{-10-6<z<0}+P{0<z<0-6) 
=0-5+0-2258 

=0-7258. 


10-4. Discrete Uniform Distribution 
This distribution is of the form 


1 
Se SOT we n 
P(x) nel r 


where n is a positive integer. x is called discrete uniform random variable. 


Mean and Variance 


- n(n+1)(2n+1) 
. 6(n +1) 
_ n(2n+)) 
6 

Ha = 13(0)-X° 


_ mQn+ty ne 


6 4 
n(n +2) 
12 
M.G.F. 
M,(t) = E(e”) 


= — {l+e' +e%+..46%} 
n+ 


\ 
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1 1- efrtbt 
% n+l 1-e' : 
10-5. Geometric Distribution 

' The prob. dist. 


P(x) = aD, x= 0, 1, Dyin = 1-p 
is called geometric distribution. 


x= E(x) = > xq" p = p{q+2q +3q>+...} 


x=0 


= pq{l+2q+3q°+......} = pq(l—q)? 


E(x”) = > xq" p =p DY tx(x-i) +.x)q" 


x=0 x=0 


> x(x-1)q* + p Ya" 


x=0 x=0 


5 (0) 


= p{2-1q? +3-2q3 +4-3q*+.....} +4 


P 
4-3 5-4 
= 2q7p414+3q+——q? +—— 9°... 4-2 
“4 Ge tad - 


- 29° pay? +5 


i 
S 


2) + 
Ha = ny0)-7 544-4 (922) 
P p Pp 


Ex. 10-81. Find M.G.F of the geometric dist. 


tl 
pu 
S 
g 
— 
iT} 
GS 
= 
Sg 
= 
3 


Sol. M4(t) 


I 
M 
Ss 
W. 
“sh 
a) 
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Ex. 10-82. [ftwo indepen 
distribution, show that the cor 


Sol. Let x,, x2 both follo 


Now P(x, =r/x, +x; 


which is discrete uniform dist. 
Ex. 10-83. A population i 
affects a proportion p of the pc 
find the proportion of dying di 
Sol. Prob. of an attack aff 

“. Prob. of an attack not i 

The individuals dying dur 
(r—1) times in the first (n-1 
Now prob. of having dise 


where 
. Prob. of dying during i 


which also gives the proportic 


10-6. Negative Binomial dist 
In the last question, prop: 
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Ex. 10-82. [f two independent random variables x, and x, have the same geometric 
distribution, show that the conditional distribution of x, given x, +x, =n is uniform. 
Sol. Let x,,x both follow the same geometric distribution 
P(x) = pq*,x=0,1, 2.... 


P(x, =13 xX, +X. =n) 
P(x, +x2 =n) 


Now P(x, =r/x, +x, =n) 


3 
+3q'+...} — P(x =r; x2 =n-1) 
n 
} = pq(U-q)? > Pe = 5542 =n=5). 
s=0 
_ P(x =r).P(Q ==) 
as > P(x, =5).P(a, =n-s) 
py {x(x-i)+.x)q" s=0 
i (‘. x1,x2 are independent) 
nxg" _ pq" .pq"” 
20 n 
> P9'.pq"* 
3q*+.....j44 =0 
P 
: 0, 1,2 
= r=0, 1, 2,........ 
Beal ed so veri ; aes 
2! Pp ‘ which is discrete uniform dist. 
Ex. 10-83. A population is subjected to recurring attacks of a disease and each attack 
affects a proportion p of the population. Assuming that r attacks are fatal to the individual, 
find the proportion of dying during the nth exposure. 
Sol. Prob. of an attack affecting an individual = p 
.. Prob. of an attack not affecting an individual = 1 — p. 
The individuals dying during the nth exposure will be those who have had the disease 
Pere (r 1) times in the first (2-1) exposures and catch it again. 
= p (22) = — ’ Now prob. of having disease (r —1) times in the first (n-1) exposures 
= n-ti Oyj p! raid 
where q=i-p 


-. Prob. of dying during nth exposure 


= @ C1 p gq’) (p) ne a a q"" p" 


which also gives the proportion of dying during the nth exposure. 
10-6. Negative Binomial distribution 
In the last question, proportion of individuals dying during the nth exposure. 


n-l ron-r 


= Cr-1P 
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Since death does not commence until the rth exposure, the proportions of death at the 
rth, (r +1)th,...exposure are , 


r(r +1) r 
PY. rap" qQ? py. 


which are the successive terms in the binomial expansion, with negative index, of p’(1—q)” 


a 


Steg" p= p-gy" = : 


n=r 
The dist. (changing n to x + r) 
P(x) = "es q*p",x=0,1,2,....... 
is called Negative Binomial distribution. 


R k. P P = 
emark. Put HEA = 7 
2 - Q 
where Q-P=1 
P(x) = ie Qu P* (. alia PE _*tr-l c,) 


= 7 -r-x = x 
= "e,Q (—P) 
=> All-the quantities like mean, variance etc., for negative binomial variate can be 
written from those of binomial variate by replacing 
. nby -r;q by Qand p by —p 
eg., Mean = (—r) (—P) =rP 
Variance = (-r)(—P)Q=rPQ and so on. 
10.6.1. Mean and Variance 


x+r-1 


Xor 
x. Cr-14 P 


Ms 


at 
Oo 


x 


a 3! 
2 i 
(r+ 2)(r +1) ?..} 


= rqp Sosa mI 


rap’ (1-q) 


r-1 qd 
P 


{t 
ay 
n 
» 
+ 
% 
! 
™ 
SQ 
a) 


15(0) 


= > {x(x—1) +x} **"1c,_, q*p" 


x=0 
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al - 
Since — >1, 42 > x. 
P 
10.6.2. Recurrence Formul: 


Let x be a negative binor 


P(x) whtrxel ro q'p*, Xx: 


Also 


Diticventiating wrt. q 


Now 


4(q\_4 
also dq\p} dq 
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he proportions of death at the 


negative index, of p’(1—q)" 


ative binomial variate can be 


and so on. 


r+ 2a +)r aso} 


+2)r+) > 


1 <r 
€r-14 P 
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paso _ q° +3-2 C2 tee +Dr | Pec § 
. ° Ls P 


2 
r(r se 4 


p(r+lrq?(i-q)" 2 +4 = 
Pp 


Pp p 
by = h(0)-¥° 
_ rr+ig? rg r°q? 
p? pp? 


re r 
Tq+py=-t 
P P 


1 a 
Since —>1, 2 > x. 
Pp 


10.6.2. Recurrence Formula for Moments about Mean 


also 


Let x be a negative binomial variate with parameters k and p. Then prob. f” of x is 
P(x) ak+x-1 Cy q*p*, x =0, 1, o:. 


ha. 


Also x 


: 
By = ex} 
! p 


Diticrentiating wrt. q 


du 5 k+x-1 kq ‘ic d(q)\\ xk 
i] Seep c,| rr) x-— ~k—|—|¢q'p 
iq. San P dq\ p 


Now p=1-q 
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a2 al 
ip 
) ~1 
du, _ yet ~A{s—2) apt 
dq ae Pp P , 


q x=0 
oe es x Brad 
9 p q 
du, rk 
=> Hera = atte s af 


10.6.3. Moment Generating Function and Cumulants 


M(t) 


ao 
tx. x+r—1 xor “ 
> e Cr1q P : 


= pr». xtr-1 c,-1(qe’)* = p’(1-qe')” 
x=0 


. Cumulative f” is given by 


Ko(t) = log Mo(t)=r log p—r log (1— qe’) 


= -r log {2-fe'} =-riog {1 -Lee -0} 
P P P 
2, 33> 4 
= —r log ie fre ier ree Sr oe 
Pp 2! 3! 4! 
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and 


Ex. 10-84. Find the lir 
but L=m (a finite consta 
Pp 


Sol. For negative binor 


Let r>0,q—-0 so 


which is the probability fu 
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ewe op Dp 
-1 
k r 
ta) q* = “F(l+q) 
| 2 3 4 
md =! G4 Ne ig eg 
Pp } and ky {8 p? p> p* 
r+l 
2) kale 9X pk | = pe ltagea’h. 


Ex. 10-84. Find the limit of negative binomial distribution when r > 0 and q—->0 
rq : 
but — =m (a finite constant). 


Sol. For negative binomial dist. 
P(x) = Xtr-1 C4 q*p" 
(x+r-1)! , , 


~ (r=Dix! 


_ @tr-Ietr-2)..0 (=) gy 


r*.x! 


A022). Ne) 


rq 
Let r > ©, g > 0 so that ries Then p> 1. 


Xm 
me 

P(x) > —— 
x! 


which is the probability function of Poisson dist. 
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10.7. Hypergeometric Distribution : 
Suppose an urn contains Np white and Ng blue balls (p+q =1) and r balls are to be 


drawn one at a time without replacement. Let P(x) be the prob. that out of r balls drawn x 
are white. Then 


- P(x) 


Ml 
2 


rg (NDI (Na) 
Cente Nia 


where x = x(x-1)......(x-r+1) 


Np Nq 
ps Mea » Mey! 
t=0 


Consider (1+ y)*?(1+ yy" 


s=0 12(0; 
N 

and aty% = } Mey" | 
r=Q 


Since (1+ y)*? (1+ y)%4 =(1+y)%, equating co-efficients of y’. 


7 
N pe Np Nq 
Cc, = DS Cy Cr-x 


x=0 


»y P(x)=1 
x=0 


.. P(x) can be taken to be a probability density function. The distribution Bx A G8, Fee Oren 


deduce the values of moments 
(x) (r—x) 
r_ &Np)(NQ) 


P(x) = ,x=0,1,2,...r 
aaa no Sol. A 
is called Hypergeometric Distribution. 
10.7.1. Mean and Variance of Hypergeometric Distribution 
r Np 
z= De x. ot Crex 
x=0 c, 
r Np Nq r_ Np-t Nq 
Coe Ce: Cy Cr 
aa es x =i =Np), i r-x 
x=! c, x=l c, 
ie - Let F(a, B; 
— YP_ Np+Nq-! C= ~ N-1 C,1 ='p ; 
C; Cc, 
Then A 


YEP) oF a(x ~1)P(x) +o2P@) 


x=0 x=0 x=0 


anal 0 
: 2 ) F(a, B,y, y) satisfies the 
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q=1) and r balls are to be 


, that out of 7 balls drawn x 


The distribution 


= 0,1, 2,...7 


P(x) +) xP(x) 


x=0 
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r Np, Na 
no 
= y xx-)— +p 
x=0 cy 


(NPND=D) > p-a 
cme ee a 
r 


Cry +'P 
x=2 


(Np)(Np -1) Np+Nq~-2,, 
—— ae 
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= +7, 
No. 2+'p 
Np)(Np ~1) y- 
enit PX p-')n 25 4 trp 
c %~ 
N-2 
C,. r(r-1) 
£(0)—rp = (Np).(Np-1). r=2. — (Np)(Np - 1) ———~- 
Hs(0)~ mp = (Np). = (NEN DE 
rp(Np -1)(r -1) 
50) = ——————— + 
H2(0) N-1 'p 
: = Np-1)(r -1) 
2 ae 415(0)- x? _ PL Pp Yr 2 + p-r?p? 
N-1 
'p 
= Ware Aer tlt Nolen +p) 


"Pq 
= 24 (n-r), 
N-1° r) 


Ex. 10-85. Find differential equation satisfied by Mo(t) of hypergeometric dist. and 


deduce the values of moments about mean. 


E (x) (r-x) 
Ni Ne 
Mo(t) = yet ro, § pp) (Nq) 


Sol. NO 


x=0 


2 ye r, (NDP (Ng) 
SN (Ng-r +) 

= (Nq)” y (Np) .re™ 
NO EA(Ng-r +x) x! 

OB y , (a+) BB+!) y* 


Let F(a,8;y, y) = 1+ a 
«FO yy yl! Wy+1) 2! 


( Nq)” 
NO 


Then M,(t) = F(-r,-Np; Nq—-r +1,e') 


F(a, B,y, y) satisfies the different equation 
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2 


dF 
(l-y) 
yl- ye 


+{y-(a+B+ by -oaBF =0 Equating co-efficients of t, c 
y 


Ul 
iY 


Put y 
Then diff. eq. reduces to 


a’F dF 


(l- ro aera (a+f)e' -l}-aPe' F =0 Hl 


Ex. 10-86. Show that when N 
Sol. For hypergeometric dist. 


{(Np)(Np - 1)... 


“. Mo(¢) satisfies the ee 


ry 2 Mol?) dM,(t) 


(1—e") 2 4 — 2 (Nq -r +1-(-r — Np) e! —1}—rNpe' Mo(t) =0 P(x) ="e, 
dt? dt 
2 
or (1-e') g aM -(r+N p) Mo EO rNpM oC! +n Mo® _ pM, (1) =0 p-4}-(>- 
dt* dt ; N 
= : Cy 
Pe # 
To find mean put M(t) = Dee ope _ ; 
=0 
Then Let N—- 00 
Then P(x)> 


which is the probability function f 
Ex. 10-87. Deduce the moi 
hypergeometric dist. 
Sol. For hypergeometric dist. 


(l-e eth +N DMG rie Np Ds “} 
pres ss a PMS 


Put t=0 
1 
Nyy ~rNp = 0 or py = rp 
Now M;(t)=e"™Mo(t) 
“. M(t) =e M(t) and Ul 
Substituting in the diff. eq. 
2 
“M; dM;(t dM ;(t 
ae] PMO pq) np) AEDs —nparm co |e LEO =o 
| oat? : dt dt 
To find moments about mean put 
i : r? ‘ 
M;(t) 2 Dies = and e ices ak adeas et N- © 
soo Then Ly for] 
i Pa and i; for] 


10.8. Multinomia! Distribution 

Let there be a series of 1 ind 
several outcomes say E), E4,....... 
trial where 


> are ott q)- sD acer 
s s-l 
+(N ae> Hs A, N x Hs ani Di 
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-oPF =0 Equating co-efficients of 1, | ene 
_ tpq(N~-r) 
A N-1 
_ 1pq(q- PN -r(N -2r) 
ape! F =0 B3 (N ~1)(N -2) 


Ex. 10-86. Show that when N + « hypergeometric dist. tends to binomial dist. 
Sol. For hypergeometric dist. 


{(Np)(Np ~1)...(Np = x+ I} {Nq)(Ng ~1)...(Ng—r+x+1)} 


‘—1}—rNpe! My(t) =0 ees NON =1)...0...(N —r +1) 


ye -rNpM(t) =0 


Let N> © 


Then P(x) >"c,p*q'* 
which is the probability function for binomial dist. 


s=0 Ex. 10-87. Deduce the moments (about mean) of binomial dist. from those of 
‘ hypergeometric dist. 
i Sol. For hypergeometric dist. 
A ' Pe 
4s Goa NP Ms oO ; 
rpq| |-— 
_ tpq(N-r) _ ral 4 
f N-1 fae 
N 
an a _ IPUG— pN —r)(N - 2r) 
mp (N -1)(N -2) 
r 2r 
y — p)} 1-— |} 1-— 
ab. a Hoe pag p( a =| 
pt ( 1-2 
N N 
a . 
+t th Pacacke Let N>0 
Then u, for B.D. =rpq 
= po and p3 for B.D.=rpq(q-p). 
> My 
a" “(s-D! 10.8. Multinomial Distribution 
Let there be a series of n independent trials where each trial may result in one of the 
as a ry re several outcomes say Ej, E5,........ E,, with respective probabilities p,, p,....... P, ineach 
» — t+ ; _ : 
~ Be" L. Hs (s-b! trial where | 


Pi + pot.....+py =1 
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Suppose the event E; occurs x, times (i =1,2,.....k) 
Then x, + x24+.....¢%, = 


By the theorem of compound prob., prob. of E, occurring x, times, Ey occurring x, 


times and so on in any fixed definite order = p}' p3?.....p,' 
Now out of » trials x, trials can be had in ee ways and out of remaining (n-— x, ) 


trials x, trials can be had in " “tc, ways and so on. 


. The total number of ways of getting E, — x, times, E,— x, times,...E,—x, times 


_a n-x NAN HX ee XK 
ie aia, a ley 
a n} (n—x,)! (NX, — Xq 0.0 —Xy_y)! 
x, (n—x,)! x "n—x,—x2)! xy Mn — xy... ay)! 
n!} 
= nee _- _ = != 
Xs Xp heonennky! (Ce (n-x,..-x,) = O!=1) 
By total prob. theorem, prob. of getting E, — x, times, E, —x, times,......E, —x, times 
at Xp t2 Xt 
eee = eee Pp; 
Xy!Xg be. xy! 
n n=Xx, (N= Xe Xp-4) n! 
Total prob. = >: > Steet > po}! pz nee pe 
XyIXg he ! 
4, =0x, =0 x, =0 pr"2 
} n 
Consider (p, + pp......+Py)" = {Dy + (Pz t+ Pye +PK)} 
; 
n x, n-Xx 
= > Cy, Pi! (P2 + P3-PR) 
X= 
n n-X, 
= n x, n-x, Xy N-X,-Xz 
= Cy, Pi’ > Cy, Py? (Pate APE) | 
a — 
no MAX 
s n n~-X, xX, x n—-X,-*. 
= Sy » Cy, "Cy, Py Pr’ (p3t-tPy) 1? 
x,=0x,=0 
no A-X, WAX Kysy 
= n n-X N-Xy ee XL xX Xx: x, 
=> > wes > { Cy ey Me, Pipe Pi} 
x, =0x,=0 xpe 
no H-N, NOX we Xp_y t 
= n XX) Xy 
Spin Dias. = Pi! P2’ ++ Pk 
oar ae feo Tg bees " 


THEORETICAL DISTRIBUTION 


Hence the function 


P(X, Xq500-Xy) = 


can be taken to be probability function 
by P(x,,%2....%,) together with the v; 


10.8.1. Moment Generating Functic 
Distribution 


Now Mo(t,, tyegthy) = Ble 


NMRX NX Ay y 


=>¥ ¥.. 
->r-d rar 


= {np;e" (pe + pze”..4 


= np; 


E(x?) = 


=n(n-1) pip; 
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g x, times, E, occurring x, 


nd out of remaining (n— x,) 


> —X_ times,...E, —x, times 


(1 (n-X,..-x,) = O!=) 


—X, times,......E, — x, times 


eskeusds Pi 
n 

x. x, 
47 pit} 
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Hence the function 


P(X, X25..-X,) = XL, !...%,! 
can be taken to be probability function. The dist. formed. 
by P(x,,%2,.--%,) together with the values of x, x2,...x, is called Multinomial Dist. 
10.8.1. Moment Generating Function, Moments, Covariance, etc., for Multinomial 
Distribution 

Now Moltystyesatn) = Bi etitrtotie } 


N~Xy NA-Xy we Xp y 


n! Xx Xg td tt tx, 
sy ——"t ___piipi..pjt errs 
x20 ge0 1X haw. 
” t, \*k 
= k 
ee ela”) (Pee) 
XX, xy RUD eke 
t ty tyn 
= (pe! + Pye? wecsevee +p,e") 


a E(x ) _ ef 
2. E(x;) =4—2 
Ot; 1,70, j=1, 2.4 


= {npje" (pye"' + pre” ...+p,e*}"} 1-0 


= np; 


= {npje" (pye"' +.....+ pet 


+n(n—1) p? e" ‘(piel t......ppe*)"7} 1,=0 
jel, 2,.k 


np; +n(n— Ip? 


Var (x;) = np, +n(n—1)p? -n*p? = np,(1— p;) 


=n(n—-1) pip; 
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-, Cov. (%; x;) = E(x;x;)— E(x; Ej) 


= n(n— lp; Pj —n* pip; = —NP;P; 


Aliter ° . 
“s Bivarl: 
E(x,) = > X%;-—=————— | px? : 
1) 2 a Nats Uovstin mie a Pr 
oD n! 11.1. Discrete Bivariate Distributi 
= Pi ap; > = i ’ a Pi sasweed: Pt : 
i Le he ie k In the case of discrete bivarii 


a P and the pairs of values of X and Y 
Ey ap, 7! + Poet PK) for the pairs. 
i 
Let x,,%5...X, and yy, V2), 


= np(pyt+ Pr-+Py)” | = np; : 
i(Pyt+ Prt PE Pi for the pair (x;,y;) be denoted by 


m m 


Then y > fi 


i=) j=l 


E(x?) 


It 
, 
= 
* 
as) 
at 
3 
alt) 
) 
7 


is the total frequency. 


Op; “A x xg he. 
If X and Y are random variates 
0 é ! 
= Pj ap Imp, a als = 7 Pi" Py uate rt} then the function p S.t. 
At XN boa 
PUM; 
te} é ; ee 1: 
= Dp; a {p a (p, + Po tpt is called the joint probability funct: 
Pi Pi Definitions 
) “ (1) Let Px (Xj 
= Pia (Pi (Py + Pa-+PE)” ') 
Pi and Py; 


= p; {n( py + Po. +Py ay t+n(n—1)p;(pyt..-+ Py yy Py (or py) is called margin: 


= p; {n+n(n—l)p,} = np; +n(n-l)p;’. (2) pe (ly, 
it Nj 


; n} ' 
E(x;x:) = fp aa Se apy NN py RD x ; 
" (x);) Dow x! . Baer Pk is called conditional probabi 


Similarly Prix (oj! Xi 


1s called conditional probability 
(3) X and Y are said to be inde 


E 
Otherwise they are said to be 


pip, (p+ y" 
= Pi) Pj (Pi t+ Poe 
ap, ‘ep, 11 P2--++Pk 


= n(n—1)p;p;- 
Q00 
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a 
wiv Pr 
Phot Pe 
sone Pit 
ie rene ae 
u Xp 2X4 
eee xy! mie 
nr| 
) 


(n-l)p;(pyt...+ pz)" 7} 


1-1)p;’. 


1 


Bivariate Distribution 


11.1. Discrete Bivariate Distributions 


In the case of discrete bivariate distribution there are two discrete variates X,Y 
and the pairs of values of X and Y are considered. The frequencies or probabilities are 
for the pairs. 

Let x), %2-.-X» and y), y2...y, be the values of and Y respectively and the frequericy 


for the pair (;, y;) be denoted by fiz: 


Then Di = WN (say) 


is the total frequency. 

If. X and Y are random variates, and probability for the pair (x;,¥;) is denoted by Pj 
then the function p S.t. 

P(X Yj) = Py 

is called the joint probability function of X and Y. 
Definitions 

(1) Let Px) = Pi t+ Piat--+Pin 
and Py(¥j) = Pry + Pajte+Pmj 


py (or py) is called marginal probability function of X (or Y) 


Pi 
(2) Pxiy(%/y;) = — 
Py(y;) 
is called conditional probability function of X given y = y; 
Pa Pij 
Similarly Pyix(yj/x)) = —— 
Px (%) 


1s called conditional probability function of Y given X = x; 
(3) X and Y are said to be independent if 


Py = Px (x;)x Py(y;) 
Otherwise they are said to be dependent. 
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Joint discrete density function 


The joint discrete density function fy y(‘.) is defined by 


fy (y) = {P{X=x,Y=y} fora value pair 


(x, y) of (X,Y) 
0 (otherwise) 


Marginal discrete density functions 
Marginal discrete density functions are defined by 


Fx (4%) = bie (xz+y;) 
7 


fr Ox) = yy (Xj Ye) 


Joint Cumulative Distribution Function : 
Def : Let X, Ybe two random variables both defined on the same probability space. The 
joint cumulative distribution function of X, Y is denoted by Fy y(‘“) and is defined as 


Fy y(x,y) = P(X S$ x;¥ < y) for all value pairs (x, y). 
Properties of Cumulative Distribution Function 
cdf. Fy y(-') satisfies following properties : 


(1) lim Fyy(y) = Fyy(-«,y) =0 for all y 
X00 
lim Fy y (x,y) = Fy y(x,-0) =0 for all x 
yo , 


lim Fy y(x,y) = Fy y(%,0) =1 
x70 


yoo 
U1) O<SFyy(xy) <1 


(IIT) Fyy (x,y) is monotonically non-decreasing 

(i) Fyy(y,y)2 Fyy (4, y) if x, 2x, 

(ii) Fyy Qy1)2 FyyQuy2) if y, 2 y2 
To prove (i) we observe that 

P(XSx,Y¥ Sy) = POX <x,,¥ Sy)+ P(xy < XS x), <y) 

= Fy y(X%.y) = Fy y(xa,y)+ Pry <X S4,,Y Sy) 
> Fyy (%y) > Fy y (x,y) 
Similarly (ii) can be proved. 
(IV) If x, <x, and y, <y, then 


P{x, <X Sx2,y, <Y¥ S yp) 
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= Fy, 
-F, 

To prove this define 
A, =(X $x), A, = (X 
B, =(Y Sy), By = (Y: 
Then L.H.S. = P{{ 
= Pt{r 
= Pi 
= PL 
+P 
= Fy 
Marginal Cumulative Distril 


Marginal cumulative distr 
F X (x) = F xX 


Fy(y) = Fy 


Result : Fy(x)+ Fy(y)- 


Proof : Define events 4 a 


Then P(A) 


PCB) 
By additive law 
P(AU B) = P( 


P(AMB) > P\ 


P( 
P( 


WW 


> Fy 
AN 
AN 


By (x) 
Sometimes, same small ] 
Ex. 11-1. x and y are twe 


f(y) = ; 


where x and y can assume on 
of y for given x. 
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efined by 


=y} fora value pair 
(x, y) of (X,Y) 
herwise) 


by 


y;) 


yx) 


ied on the same probability space. The 
noted by Fyy(-.’) and is defined as 


). 


:Q for all y 


0 for all x 


y) + P(x, <X <x,,¥<y) 
(x2 <XSx,,V¥ sy) 


Sy) 
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= Fy y(%25¥2)— Fy y 20M) 
—Fy y(%.¥2) + Fy yy) 
To prove this define 
A, =(X $1), 4, = (X $xp) 
B, =(¥ Sy), By = YS yz) 
Then LHS. = P{(A, -A,) A(B, -B,)} 
= P{A, \(B,-B,)}- P{(A; A(Bz - B,)} 
= P{A, By}— P(A) 0 B,)— P(A, 0B) + P(A; OB) 
= PIX $x,,¥ Syp)—P(X $ x9, ¥ S$ y)— P(X Sms ¥ S yp) 
+P(X <x3Y Sy) . 
= Fy y(X%2.¥2)— Fy y X21) — Fey 1 ¥2) + Fey rd 


Marginal Cumulative Distribution Function 
Marginal cumulative distribution functions of X, Y are defined by 


Fy (x) = Fy y(%,%) 
Fy(y) = Fy y(®, y) 
Result : Fy(x)+Fy(y)-lS Fy yQuy) Ss VF y(x)Fy(y) for all x, y. 


Proof : Define events A and B such that 
A:X<x and B:Y<y. 


Then P(A) = P(X $< x)= Fy(x) 
P(B) = P(Y Ss y)= Fy(y) 
By additive law 
P(AU B) = P(A)+ P(B)- P(ANB)S1 


P(AQ B) > P(A)+P(B)-1 
=> Fy(x)+Fy(y)-1s P(ANB) = Fy y(%y) 
AQBCA= P(ANB)§ P(A) = Fy (x) 
ANBCB=> P(ANB)§ P(B)= Fy(y) 
2 {P(AMB)}” $ Fy() Fy) 


=> P(AMB)< VF x(x) Fy(y) 


1 Ey (0) + Fy) -1S Fyy (ny) Sf Fx OF): 


Sometimes, same small letters are used to denote variates as well as their values. 
Ex. 11-1. x and y are two random variables having the joint density function 


flay) = sy (e429) 


where x and y can assume only the integer values 0, 1, 2. Find the conditional distribution 
of y for given x. 


428 : MATHEMATICAL STATISTICS 


Sol. By given 


1 ; 
f(x,y) = a +2y),x=0,1,2; y =0,1,2. The table below gives various values of f 


yo 
Xx ob 0 l 2 te 
0 0 A as & 
27 27 7 
= 2: & = 
2 a ca 6 12 


The last column headed f, gives the marginal probability function of x. 


The table giving the values of conditional probability function of y for given x. 


a 0 1 2 
1 2 

0 0 5 ; 
1 3 5 

1 Ey 5 nl 
2 4 6 

? 1D 2 rr) 


This is obtained by dividing each row by corresponding entry under ,. 
Ex. 11-2. Two unbiased dice are tossed simultaneously. If x and y be the numbers on 


two dice respectively, find 
(@) P{x+y=6|y=2}. (ii) P(x~y =2). 


1 
Sol. Both x and y can take values 1, 2, 3, 4, 5, 6 each with probability Ps 


The joint probability function of x and y is given by 


11.1 
sy) = >.> =— Vxand Vy. 
P(x, y) 66 se y 
The table listing the values of p is 
y-> 
Pa 1 2 3 4 on 
nn ee 
a, er ee: eee eee | 
36 36 36 36 36 
A ge a ca 
36 36 36 36 36 
; 4 s& ff, a 4 
36 36 36 36 36 


36 

i 

36 

i 

36 
(Contd.) 
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1 a 
36 EY 
: 36 3 
| J 
6 36 34 
ae 
fy 6 ¢ 


The last row gives the probabi 


@ .  P(xty = Oy=2 
ples 
“py 
si) 
~ 36/ 6 
(ii) P(x-y=2) = Pix=: 
+P(x = 
1 
36 3 
Ex. 11-3. The joint probability 
by the following table : 
x7 
yt 
1 
2 


Find (i) the marginal distrib 
(ii) the conditional dist 

iii) Pix+y $3}. 

- Sol. The given table is 


(i) The marginal distributio 


distribution of x is given by row h 


(ii) The conditional distrib 


corresponding entry in column he. 
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i a A ill i Es : 
> below gives various values of f 20 ~ ~ i = a 
. i Es - ze = : 
2 ty 36 36 36 36 36 36 
4 6 6 ~ 36 x 7 36 = 
7 7 36 36 36 36 36 36 
3 9 Py “ , 7 5 6 6 
7 27 : : ° ° ° : 
6 12 The last: row gives the probability function of y 
bility function of x. p{x =4, y =2} 
function of y for given x. a pty) 
y 
slg het 
36/6 6 


i 


P(x =3,y =) +P(x =4, y =2) 
+P(x =5,y = 3) P(x =6,y = 4) 


(ii) P(x-y=2) 


bo 5 Ee. 


—+—+— : 
36 36 «636 «636 «(9 


' 
: Ex. 11-3. The joint probability distribution of a pair (x, y) of random variables is given 


1g entry under f,. by the following table : 
ly. Ifx and y be the numbers on x—> 
yl 1 2 3 
1 0-1 0-1 0-2 
' 2 0-2 0:3 0-1 
vith probability e | Find (i) the marginal distributions 


(ii) the conditional distribution of x given y = 1 
ait) P{x+y <3}. 
y: ; - Sol. The given table is 


5 6 
. i 1 
) 36 - 36 ‘ 
- aan i (i) The marginal distribution of y is given by column headed Py and the marginal 
) 36 36 : 

\ distribution of x is given by row headed p,. 
1 : 
36 = ; (ii) The conditional distribution of x for y = 1 is given by dividing first row by 


(Contd.) corresponding entry in column headed Py- 
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.. Conditional distribution of x for y =1 is 


1 1 41 
P(x/y=)) a. ie a 
(iii) P(xt+y $3) = PUxt+y=2)+P(xt+y =3) 
= P(x=ly=)l)+P(x=1,y =2)+ P(x =2,y=1) 
= 0-14+0-24+0-1 
= 0-4, 


Ex. 11-4. Two discrete random variables X and Y have 


p(0,0) = 2 (04) = 50) = + 5 PUL) => 


Test whether X and Y are Gite 
Sol. The given data can be put in the form of table below 


Since p(x., y) # py(x).py(y), X and Y are not independent. 


Ex. 11-5. Two tetrahedron (regular four-sided polyhedron each with sides labelled 1 to 
4) are thrown together. Let x denote the number on the downturned face of the first tetrahedron 
and y the larger of the downturned numbers. Find the joint discrete density function of x and 
y. Also find their joint cumulative distribution function. 
Sol. Possible value pairs for (x, y) are : 
(7,1), (,2), (1,3) ,4) 
(2,2), (2,3), (2, 4) 
(3,3), (3, 4) 
(4, 4) 
Consider pair (2, 2). 
Here no. 2 appears on first tetrahedron and on second tetrahedron nos. 1 and 2 both can 
appear. 
cauhors ea 
*. prob. of pair ( og a A Te 


Similarly for pair (3, 3), no. 3 appear on first tetrahedron and nos. 1,2, 3 can appear on 
second one. 


3 
.. prob. ir (3,3) =— 
prob. for pair (3, 3) 16 


4 
and prob. for pair (4, 4) = 16 
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ee 


Prob. for each of the othe 


.. Joint discrete density 1 


(x, y) > (1,1) (i, 
1 a 
FryGry) i6 i 
(x, y) : 3.3) G, 
3 1 
Fry Q5y) ié ‘i 


o’ in another tabular forn 


f(x) 


Table for joint distributic 


This is because 
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1 
Prob. for each of the other pairs is —. 


16 
1 
2 .. Joint discrete density function is : 
P(xt+y = 3) (x, y) : qd, 1) (1, 2) (1, 3) 1, 4) (2, 2) (2, 3) (2, 4) 
+ P(x=1,y =2)+ P(x =2, y=1) Pao ers i i i 2 i i 
a ann 16 16 16 16 16 16 
(x, y) : (3,3) (3, 4) . (4, 4) 
have , ‘c 4 
n(l,0) = 2, puiy=. Fry) + 76 16 16 
9 9 o’ in another tabular form is « 
: below 
Py 
3 
9 
6 
9 
ependent. 


f(x) 4/16 4/16 4/16 4/16 
hedron each with sides labelled 1 to 


vaturned face of the first tetrahedron 
nt discrete density function of x and 


l<x<2 2<x<3 3<x<4 


4) 


d tetrahedron nos. 1 and 2 bothcan 


dron and nos. 1,2, 3 can appear on 


This is because 


F(,1) = P(x<ly<t) 


1 
= P(ix=ly=)=— 
(x=lLy=1) 7? 
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F(2,3) = P(x <2, y <3) 
Pixtl,y=)+P(x=1 y=2)+P(x=1, y =3) 
+P(x =2, y =1)+ P(x =2, y= 2)+ P(x =2, y =3) 


1 1 1 2 1 = 6 
+—+—+0+—7—4+-—-=— 

16 16 16 16 16 16 

{- P(x=2,y =) =Oas y> x} 


similarly others. 


Remark. Marginal densities can be obtained from the joint densities as shown in first 


table of last example. But converse is not true. 
Ex. 11-6. Let x, y be jointly discrete random variables such that each x and y have at 
most two values. Prove or disprove : x and y are independent iff they are uncorrelated. 
Sol. Let each of the variates x, y take values 0 and 1. 


let {yy =0 

=> ~ Cov (x, y) =0 

= E(xy) = E(x) Ey) 

> 0.0. P(x= 00 y=0)+10 Px=1N y=0) 


40.1. P(x =ONy=)D4+ll Px=lay=l) 
= {1 P(x =1)+0.P(x=0)}. {1 P(y =1) +2. P(y =0)} 
=> P(x=1 A y=) = P(x=)).P(y=1) 


=> x and y are independent. 
Converse. Let x dnd y by independent 


Then Cov (x, y) = E(xy)- E(x)E(y) 
= E(x).E(y)- E(x)E(y) 
=0 
fy =0. 


11.2. Continuous Bivariate Distributions 


In the case of continuous bivariate distribution, variates Y and Y are continuous. Here 
various terms are defined as below : 


~ (1) Probability Density Function. A continuous function fy y (‘.")s.t. the probability 


~ dx dx 
of the value of the variate to lie in infinitestimal intervals eee and 


dy ay 
E med + 4 can be expressed in the form fy y(x, y)dx.dy, is called probability density 


function or simply the density function. 


The density function fy y(.") has the following properties : 
(i) fy yy») 2 0, Vx and Vy. 


Gi) [f fr sy) dedy = 1 
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where the integral is extended ove 
(2) Probability Differential 
fyy(%y)dxdy is called pro 


b 
Pla<x<b,c<y<d]= i 


x=a 


(3) Marginal Distributions 


Let fy! 


and fy 


Then, fy(x) [or fy(y)] 1s c 
fy(x)dx is called marginal c 


ory. 
(4) Conditional Density Fu 


Fyiy(® 


is called conditional density fun 


Similarly, fyix 


is called conditional density fun 
(5) Two variates X and Y ar 
fxr 

(6) Joint Probability Dist: 

It is denoted by Fyy (xy) 


Fy y¢ 


It is also called cumulat 
properties : 


(i) Fy y(-— 

(ii Fy y( 
a 

(iii) 3 


(iv) Fyy(,y) is monote 
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+ P(x =1, y =3) 
2)+ P(x =2, y = 3) 


ao 
16 


joint densities as shown in first 


' such that each x and y have at 
ont iff they are uncorrelated. 


0) 
l) 
- 2. P(y =0)} 


y) 


s X and Y are continuous. Here 
ion fy y (.)s.t. the probabiiity 


dx — dx 
tervals |*~>->* Tia and 
dy, is called probability density 


ties : 
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where the integral is extended over the entire range of (x, y). p.d.fis also denoted by f(.’) 
(2) Probability Differential 
fx.y(x,y) dx dy is called probability differential. Moreover 


bod 
Pla <x<b,c <y<d]= J [ fer yrde dy 


xX=a V=C 


(3) Marginal Distributions and Marginal Density Functions 


Let fy) = | fern yay 


—-o 


and fy(y) = aes y)dx. 


Then, fy(x) [or fy(y)] is called marginal function X (or Y). 

fy(x)dx is called marginal distribution of X and f,(y)dy is called marginal distribution 
ory. 

(4) Conditional Density Function 


fyy (ay) 
ly) = — 
Fan(aty) fy) 


is called conditional density function of X given Y 
Fxyy) 
fy (X) 


is called conditional density function of Y given X. 
(5) Two variates X and Y are said to be independent (or stochastically independent) if 


fxyQoy) = fx). fy). 
(6) Joint Probability Distribution Function 


Similarly, frixQy/ x) = 


It is denoted by Fy y(x, y) and is given by 


Fi x y 
Fyy(ay) = J | fey(syddedy 


—-2 —00 
It is also called cumulative distribution function and possesses the following 
properties : 


(i) Fy y(-%,y) = 0= Fy y(x,-0) 

(ii Fy y(%, 0) = J JAG dx dy =1 
ea OF x y 

(iii) axdy fxy sy) 


(iv) Fyy(%.y) is monotonic non-decreasing function. 
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Remark : The conditional density functions 
Frix(y/ x) (and fyjy(x/y) 
are undefined for fx(x)=0 (and fy (y) =0 


Conditional cumulative distribution function 
These functions are defined as Zollows : 


Bf 
Fux (yl) = | fix (Ix) dy 


“OO 


for all x s.t fy(x)>0. 


and ; Fyjy(x/y) = | few (x/ y) dx 


for all ys.t f(y) >0. 


Remark : Obviously f,,y(y/x) is non-negative and 


ffx lady 2 fhe) , 
ma = fx) 


1 3 
= Fey [fer orey 


= fx (x) 
Fx (x) 
Ex. 11-7. Let fy(x) and f,(y) be two probability density functions with corresponding 


cumulative distribution functions Fy (x) and Fy(y) respectively. Define 
Fxy ny) = fy (x) fy (y) {1 + a2 Fy (x) - 1][2F,(y)-1]}, -1<as<1 
Show that : (i) fy y (x,y; &) is a joint probability density function 


(ii) the marginals of fy y(x,y;a) are fy (x) and fy(y) respectively, 
Sol. We have 


O< Fy(x)<1 
> 0<2F,(x)<2 
=> -1<2 Fy(x)-1<1 
Similarly -1<2F,(x)-1<1 

also -l<a<l 


“1S. {2 Fy(x)-1} {2F,Q)-1} <1 


> O<1+a{2Fy(x)-1} {2 Fy(y)-1} <2 


MATHEMATICAL STATISTICS 
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=> fyy(%ya)20 as fy(x); 


Also 1= f fare 


| 
ma 
~_~ 
os 
— 
—, 
Mine, 


ll 
_——, 
=> 
~- 

id 
— 
—— 


Let I 


1 = fxQ) 


Similarly f ferley: a) dx = 


—O 


oOo 0 


J [ far(aysarae 


=D — OO 


fxy(%, ys) is ajo 
Ex. 11-8. For the bivariate dis: 


f(xy) 
find Fy y (x,y); tx (x), fy), frix( 
Sol. K is given by 
1 pl 
i= [Ke 
1 
- Kf ae: 
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inctions with corresponding 


y. Define 
anction 


ind f(y) respectively. 
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Gat 


=> fxy(x%yia)20 as f(x); fy(y) are non-negative. 


= | fers. a)dy 


Also T= 
= f fed HOM +0127, (x) 1} {2FyO)- Lay 
= fx) | froddy + af, 2F)-I} | FO 2AO)- Hay 
Let 1, = | fro 2FyO)- Thay 
Put Fy(y)=U 
d Fy(y)=dU 


=> fy(y)dy=dU 


1 
_ [eu-nau ={u? -u} =0 
0 
bef © JfrOday=I 


Similarly | feroy a) dx = fy(y) 


oO 


om 0 co 


| [trGa addy = f foodr=1 


fy y(x,y;@) is a joint p.d.f with marginals f,(x) and f,(y) respectively. 
Ex. 11-8. For the bivariate distribution given by 
f(xy) = K(x+y) Ton Toy) 
find Fy y(x.y)s fy), Fy )s frrx 9! )s Fry & Is frix y !). 
Sol. K is given by ; 


l= [ [kt aedy 


= Kfte{x i 
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ha ae 
=r{det} 
Fyy(x,y) = if [ (s+ y)dvdy| Loa (*) Lon) 
1 Xx 
+ if dy| + y)dsayh Koay) Hay) 


1 y 
+ if dx (x+ vay| To,00) ) Loy) 


+ l1,0) (x) L410) (y) 


1 
= sles + xy’) Loy) (*) Toy) eae X) [9,1 (%) Lq,0)(¥) 


Hy +9? )Ie0y(X) lon (y)+ T1,20)(X) Kae) 


1 
fx (x) = {fc vv| Foy (x) = [« 3] Foy) 


1 : 
fr(y) = {{her+> aa tono)= (943 |fan() 


fy y (Gy) 


i= 
frix(y/ x) f(x) 


(x+y) Loy) Loon) 


1 
[« + +) Loy) 


xty 
ea Lon) 


Ul 


Sumilarly 


x+ 
fyiy (aly) = ae Toy) 


y | y? 
Fyjx(y/x) = I (v= Tits (for 0< y <1). 
xt 
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Ex. i1-9. If F (-) is a cumulativ. 
(i) Is F(x,y) = F(x) + FQ) a, 
(ii) Is F(x,y)= F(X)F(Y) ajoi 
Sol. (i) Is not true because 
F(0, 00) 


. (ii) It is true because 

(a) F(00,00) = F(00)F (0) = 

(b) F(-0,-y) = F(-«)F(- 
F(x,-00) = F(x) F(-00) 
OF _ OF(x) OF(y) 
Ox Oy Ox = Oy 

= f(x) g(y) 


(d) F(x,y) is monotonic n 
Ex. 11-10. Jfx and y are two ra, 


(c) 


1] 
fmy) = Fl 


Find (i) p{xslLy<3}. 
i) plxty <3} 
(iii) p{x<1/y <3}. 
Solution. 


(i) P{xsLy<3} 


(ii) p(x+y <3) 
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Ex. i1-9. If F (-) is a cumulative distribution f": - 

(i) Is F (x, y) = F(x)+F() a joint cumulative distribution f ser? 
(ii) Is F(x,y) = F(x)F(y) a joint cumulative distribution f” ? 
Sol. (i) Is not true because 


lon (y) F(c0, 0) = F(«) + F(00) 
=1+1=2#1 
(x) la2)(Y) . (ii) It is true because 
(a) F(0,00) = F(o)F (0) =11=1 
(b) F(-00,-y) = F(-0)F(-y) =0 
x) Loy) 


F(x,-0) = F(x)F(-«) = 0 


OF — OF(x) OF(y) 
Ox Oy ox = dy 


= f(x) g(y) 


(d) F(x,y) is monotonic non-decreasing as F(x), F(y) are so. 


(c) 


yt (x? +X) Lg) (x) T,c0)(Y) 


(1,20) (%) laay(0)] ae? 
Ex. 11-10. /fx and y are two random variables having joint density function 


LAW nln | 
9) (Oud f(x,y) = 5 (6-x-y),0<x<2,2<y<4 


Find (i) p{x <1, y <3}. 


(ii) p{lxt+y <3} 
(iii) p{x<1/y <3}. 


1 
ll Lon) 


Solution. 
1 1 3 
@ Pixshy<3}= =f [O@-x-ydae 
x=0 y=2 
1 1 3 
= «| ex] @-2-y)dy 
0 2 
1] 5 
= = | de 6-x-— 
8. 2 
oa ae ae ee 
8 2 2 8 
“ \ 1 3-x 
> (for 0<y <b), ) psty<3)= =f [@-x-yardy 
x=0 y=2 
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(iii) 


P{x <1ily <3} 


P{x<1/y <3) 


Ply <3) 


P(x <l,y <3) 
P{y <3} 


ll 
ca 


3 
[=(-x-yas dy 
2 


tl 

Al 
Ww 
~ 


5/8 


Ex. 11-11. Let x and y have the joint density function 


flx.y)=5.05y 5x52 
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Find.the marginal and conditional probability density functions. Are x and y 


independent ? 
Solution. 


(i) 


2 


x=0 y=0 


| [ronan dy 


fx) 


2 
2 x dx 
2 

0 


=1 
x 


[fray 


ysO 


ll 
| 
my 
Py 
— 
ll 
| 
= 


fy) = | flsyrdx 
: 
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(ii) fal 
fy! 
(iii) Now, L.OOFy( 


.. x and y are not independe 
Ex. 11-12. The joint density, 


fxy (x, 


Determine marginal distribu. 
Solution. 


i {tee (x, y)dx. 


x=0 y-0 
(i) Fx ( 
fy( 
(ii) Now Fy OSFy ( 


.. X and Y are stochastically 
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-y)dx dy 


on 
£2 


vy density functions. Are x and y 
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f(x,y) i 
; fc oc = oN: 
(ii) f(x/y) FO) 225 
_ fy) _ 1 
f/x) = FG s 
(iii) Now, LOAFO) = seo) 


1 
ee —y)# f(x,y) 


“. xand y are not independent. 
Ex. 11-12. The joint density function of a bivariate distribution is given as below : 


fyy(y) = z(tty),0S4<2,0<y <1 


iT 


0 elsewhere 
Determine marginal distributions and show that X and Y are stochastically dependent. 
Solution. 


f [ter (x, y)dx dy 


2-1 
x Jaxfca + y)dy 
x=0 y-0 0 0 


1 
1 
(i f(a) = 3] ore 


2 
1 2 
fry) = 5 Jt =S04y) 
0 


2 1 
(ii) Now fx Of OQ) = {+3 |uenrs fran) 


.. X and Y are stochastically dependent. 


ae aa BIVARIATE DISTRIBUTION 


Find c and marginal; condition 


Ex. 11-13. The joint density function of a bivariate distribution is given by 
| Sol. c is given by 


f(x,y) = 4xy ef 4Y), x >0, y20. 
Find the marginal and conditional probability density functions. Are «und y 
independent ? 


Solution. 
ff pos dedy 24 f faye? ae ay 
00 x=00 
= af dx.xe" | ye” dy 
0 0 
set ey | aa 
0 0 
~(x7+y?) 
@ f(x) = Jame ray 
y=0 
co ¢) 
= 4xe" ye” dy 
y=0 
= Qxe* Le” | = 2xe* 
f,0) kd fame rae (i) ° f,¢ 
x=0 
7 2ye™” 
cy fOny) _ daye 
(ii) fyGly) = = 
ad fy) 2ye” 
= 2xe* 
Axye™ +") 2 
fix (y/ x) = ——#§_— =2ye” 
2xe"* 
‘ a pletacde 
(ii) Fay) = Aaye 
= f.O AQ) 
. x and y are independent. f( 


Ex. 11-14. The joint density function of a bivariate distribution is given by 
f(x,y) =esin portyn0<x<10<y <I 
=O elsewhere. , 
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stribution is given b : . es a . : : 
as & y Find c and marginal, conditional probability density functions. Arex and y independent ? 


yz 0. Sol. c is given by 


'ensity functions. Are a und y 


tl 


[ [roneray 
x=0 0 


1 


1 
—C [ax] sin £4 ydy 
x=0 0 


1 
2 Tt 
= ‘J an -2 cos ae + »} 


2e 
Tr 


dx dy 


1 


0 


1 
} cos * ¢~cos Za+x) dx 
‘ 2 2 


4c : 


nm 


sin = sin Fa4x) 
2 2 0 


2 
2xe* 


1 
(i : fx) = { fleyddy 
0 


1 
= cf sin S (x+y) dy 
+y’) 0 


1 


ri — COS 8 (x+y) 


0 


a COs = y-cos = (+x) 
4 2 2 


T us . 
= — 4cos—x+sSiIn —xX 
* 2 2 


1 ae ee 
nm ft 
stribution is given by fyO) = J fexsdde 7 re a y) dx 


d<y<l T T mt 
= —4cos — y—cos —(1+ 
«| 5 ~~ »} 
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a cos = +sin~ 
a oe oe. 


in = (x+y) 
flxy) _ e.sin > (rty 
fx) m {eo 7 xtsin x | 
2 2 


(ii) Fix (y/ x) 


. 
sin — + 
5 y) 


Tt ee: 
cos —x+sin —x 
2 2 


_ oT 
sin — (x+y). 
Fesy(x1y) 2 


cos +sin = 
ae 4 


fig Now fy) # ODA OY) 
“x and y are not independent. 

Ex. 11-15. Show that the conditions for the function 

f(x,y) =k exp {ax? + 2hxy + by*}, -0 <x,y <0 
be a density function are 

()as0 (ii)b<0  (iii)ab-h* 20. 

Assuming these conditions to be satisfied, find k 

Sol. Let f(x,y) bea density function. Then 


oo] 


J [ rayraray=1 


xX=-0 y=-oo 


oO fo] 


ee k J Jexp {ax? +2hay + by} dx dy =1 


2 _ 2 
i f(s) (2 _ }p pita oad 
a . a 
2 _ p2 : 
7 l(>+%) (oe ae 


. The integral in (1) converges if 


a<0, b<0, ab-h*? >0 


..(1) 
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Assume a <0,b <0, ab-h* >0 
Let ‘a=-\,b=-p,h=n. 
Then 2%>0,p>0 and ab-h? 


ax? +2hxy + by : 
d= 
ra) oO os Z 
k } J exp a er 7 


X= y=-0 


ie‘a} » A 2: 
Le, L=k } oof: or y 


yo! 


~ 2 
Now jes - ween 


xX=—00 
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; 
a—(4+ 
ae y) 


ae \ 
—x+sin —x 
2 2 


o<x,y <00 


kx dy=1 


— fy? 

al: 
=z |y* 7. if a #0 and 
a 


-h? : 
se hiroec 
? 


(1) 
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Assume a<0,b<0,ab-h? >0 


Let a=-A,b=—-p,h= 


Then A>O,p>O0 and ab—-h? 


ax” +2hxy +by? = als-2, 


> 


vac 


X=—-O y=—-0 


Ul 


=Ap-n? >,0 


2 2 
) et | 
A 
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_ ee ant y?. 
(hn) 2S dey 
ptf 
( am) “la vat fexo(-t Jas where u = xd - ny 


where u” = At 
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11.3. Joint Moment Generating Function and Moments 


Def. 1. For random variates x and y, the joint moments about point (a, b) is defined 
to be 


E{(x-a)'(y-by'} 

where /,5 are zero or any positive integers. It is denoted by u,« (a, b). 

If a= x,b=y joint moment is denoted by 1,,. Thus 

Bry = E{(x-X)"(y-y)"} 

Def. 2. For random variates x and y the joint moment generating function is defined by 

M gy (tysto) = E{eh-9+20-b)) 
a, a 

where M, 4(t,,t2) denotes m.g.f- about point (a, b). 


Here it is assumed that ¢,,t, are real and expectation exists for all values of t;,t, such 


that -h<t,,t, <h forsome ;>0Q. 
The relation between joint moments and joint m.gf. is 


ian 
ep I ih) 
Hrs = Oty Ot; aaa 


4=0=t, 


te 5s : 
and y/, is the co-eff. of = = in the expansion of M, (t,t). 
ris! 


In general, if x), x2,......x,, aren variates, joint moment about point (a,,a)........ a 
defined by 
Mr ry ur, (4j425---4,) = Bf (x, — ay)" (x2 - a)” ...(x, —a,)} 
and joint moment generating f” about point (a), @3,....... a,) is defined by 


re -a;) 


Mea ty smi, Eyota rovers t,) = Eye 


In order to obtain H;,,r,,..,, "-8-/- is differentiated r, times w.r.t.t,,r, times w.r.t. ty 


and so on and then limit is taken as all ¢’s approach 0. 


" 1? Tn 
Jy es : é 

or co-eff. ~~ ~~ ---"|_ is taken in the expansion of m.g.f. 
nln! x! 


Remark : (1) Marginal moment generating functions can be obtained from joint m.g.f. 
as below : , 


M(t) = Myg(t),0) = Lim Ma ota) 


M(t) = M,y(0,t2) Hee Ma,y(t ta) 
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(2) The independence of two 
the help of m.g.fs. The result is : 
Two jointly distributed randc 


M yy (t >! 

where M,,,(4),%2) denotes joint 

The proof of this result is be 
11.4. Conditional Expectation 


Consider a bivariate distribu 


of x and y. 
Conditional expectation of ‘; 


and 


It is denoted by 


Similarly condition expectat 
Theorem : 11.4.1. El gt 


Proof: 
Let ‘ Lh 
Then E[E{g(y)/ 
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nents 


oments about point (a, b) is defined 


‘7 
ated by w,."(a, b). 


Thus 


ent generating function is defined by 
t(y-b) } 


ion exists for all values of t,,t, such 


ff Is 


i) 
1y=0=1, 


1,4t1t2). 


ment about point (a, ,a)........ a,) is 


ty —Az)”...(x, -a,)™} 


Ag yeessees a,) is defined by 


n times w.r.t.t),ry times w.r.t. ty 


gf. 

ns can be obtained from joint m.g.f. 
) 

) 
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(2) The independence of two jointly distributed variates x and y can also be tested with 


the help of m.g.fs. The result is : 
Two jointly distributed random variates x and y are independent if and only if 


Myay(tsty) = My(t)).My(t) 


where M,,,(t), 4) denotes joint m.g.f about (0, 0) etc. 
The proof of this result is beyond the scope of this book. 
11.4. Conditional Expectation 


445 


Consider a bivariate distribution with variates x and y. Let g(x, y) be a continuous f" 


of x and y. 
Conditional expectation of ‘g’ given x is defined to be 
> 8%.y PCy; 9 
j . 
for discrete distribution 
and ; 
Jae foray. 
for.continuous dist. 
It is denoted by 
EX{g(x, y)/ x} 
Similarly condition expectations given y can be defined. 
Theorem: 11.4.1. Elg(y)} = E[E{g(y)/x}] ..(1) 
Proof : 
Let _ h(x) = E{g(y)/ x) 
B Ley, jl WO 
Then ELE{g(y)/ x}] = El(a(x)] 
— dh) P(x) 
= ys Y 8(9)PO; x) Px (%) 
i j 
(using (Z)) 


_ YY 80) P12) Py OH) 
j 


! 


_ 80s) Pi 
i 


i 


E{g(y)} 


...(2) 
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Remark : (7) Similarly as above it can be shown that 


ELE{g(x)/ y}] = E{g(x)}-. ...(3) => El var(y/x)| 
(a) If g(y) =ythen(1) => 
and if Ta ‘ : ae a Also var| E(y/ x)| 
ELE(x/y)] = E(x) (5) 
Result 11.4.2. If g,,2, are f”’s of one variable then ee 


E{gy(y) + 82(y)/ x} = Efg,(y)/ x}+ E{g,(y)/ x) 


E{g,(y)a2(x)/ x) = 92(x)E{g,(y)/ x} 


Similar results hold when y is given. 
Ex. 11.4.3. Result : Conditional mean coincides with unconditional mean if and only 


E|var(y / x)| + var| E(y / x)| 


Ex. 11-16. For the bivariate dis 


if the variates are independent. f(x, y)= 
Sol. Let x and y be the variates with joint density f” f(x, y). find the conditional mean and varia 
Sol. We have 
Now E(y/x) = fy fue dy , 
“ J yl gy 
F,() 
Also E(y) = J y fy(y)dy 
Where the integrals are extended over the respective range 
“. E(y/x) and E(y) coincides iff. 
f(%y) 
| f(x) oy fyQ) => Yo 
ie., fay) = £OFO) FY) 


= x and y are independent. Marginal density f” of y is gi 
11.5. Conditional Variance 

Conditional Variance of y given x is denoted by fy 
var[y / x] 


and is defined by 


var {y/x} Ely-BO/9P /x| 


Ely? 29872) +{E0/ 9) to} 


= Ely" /x)~(E(y/ ...(1) 
- Similarly conditional variance of x given y is defined. 

Theorem 11.5.1. var (y) = Elvar(y/x)]+ var [E(y/x)] 

Proof : By (1) we have 


var(y/x) = E(y?/x)-{E(y/x)}° 


.. Conditional density f" of) 


Friy (ely. 
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..(3) 
(4) 


(5) 


l 


(y)/x) 


unconditional mean if and only 


(x,y). 


ange 


| 


+{EQ/ 9 }/ b9} 
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= Elvar(y/x)] = E[E(?/9]- {EQ oF 
= E(y*)- B{E(y/ x) (2) 
Also varlE(y/8)] = E{E(y/ OP [EE oy 
= ELE(y/ 9} -{EQ)} 3) 


using (4) of 11.4.1. 
Adding (2) and (3) 


E|var(y/ x)|+ var[ E(y/x)| = E(y*)-{E(Q)}” 


var (y). 
Ex. 11-16. For the bivariate distribution 


f(x, y)=yox’y?, = O<x<y<l, 


find the conditional mean and variance of x for given y. 


Sol. We have 
1= J J es ndedy 
1 sy 
{9G far[eyrax 
y=0 0 
, 1 
0 6 Yo 
a dy = — 
~ 7D Jy'dy= 3 
y=0 
=> Yo = 21 


f(xy) = 21x*y3 O<x<y<l. 


Marginal density f” of y is given by 


y 
f,0) = J fouy de 
0 


i 


y 
aly? | x%dx 
0 


= Ty® 


.. Conditional density f” ofx for given y 1s given by . 


f(x, y) 
wy(/y) = 
aes f,() 


y? 
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a a tat eb a ch 


. Conditional mean of x for given y is given by 


3x? 
E(x/y) = Jaa 
_ 2 
3 
r 2 3x 
Also E(x? /y) = fx ae 
Le 
32 


var(x/y) = E(x? /y)—{E(x/y)}? 


Pe eae 
Bo 46> 
3 4 

= 0° * 


Ex. 11-17. Three onbiased coins are tossed. X denotes the number of heads on the first 
two and Y denote the number of heads on the last two. Find : 


(i) -the joint distribution of X and Y 
(ii) E{Y/X = 
(iii) Pxy 
(iv) Give a joint distribution that is not the joint distribution given in part (i) yet has 
_ the same marginal distributions as the joint distribution given in (i). 
Sol. Different possibilities are : 


HHH, HHT, HTH, HTT 
THH, THT, TTH, TTT 


Possible values of X,Y each are 0, 1, 2. 
.. Joint dist. is 
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E{y/X =1} 


E(XY) 


E(X) 
E(X?) 
var (x 3 


Similarly | var (Y) 


cov (X,Y° 


Px 


(iv) Reqd. joint distribution is 
fyyQoy 


.. Joint dist. is : 
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ly)? 


tes the number of heads on the first 
‘ind : 


stribution given in part (i) yet has 


given in (i). 
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Similarly 


.. Joint dist. is : 


ae 
E{Y/X =1} = (5 


l 
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1 1 2 1 
== .090+=1-040:2-04—0.14 2144-24 
EDS 8 8 8 8 


40-0-24+4. 

8 

Din 2 2 

= —+—4+—+4+ 

8 8 8 

2 4 

=0—-+1—+ 
BOO) 8 8 
E(X?) = Ofer 
3 1 

xX = —-|]=-. 
var (X) > , 

var (Y) = 4% 


1.244.2.2 
8 


cov (X,Y) = E(XY)- E(X)E(Y) 


5 
rae 


cov (x, y) 


fa. 


1 
Pay = 4 var(x).var(y) 7 A “3 


(iv) Reqd. joint distribution is given by 


fxyvy) = Ak OAO) 


Lie 
8 8 64 


N}] colt 


oo | 

Ml 
fon 
TS 


| + Rl 


2 
8 
ee 
8 
2 
8 
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Ex. 11-18. Let a random variable X has a density function fy(-) and cumulative 


distdbution foncion Fy (-), mean wand s.d. 6. Let 


Y=a+ Bx, 


Then : 


-a<a<o, B>0 


(i) find a, suth that Y has zero mean and variance 1. 


(ii) what is the correlation coefficient between X and Y ? 


(iii) find cumulative distribution function of Y in terms of a,B and Fy(-) 


(iv) if X is symmetrically distributed about U, is Y necessarily symmetrically 


distributed about its mean ? 
Sol. (i) Y=a+BxX > E(Y)=a+f E(X) 


“ O=a+Bu>a=—-Bu. 


a 


(ii) cov(X,Y) 


var(Y) 


Pry 


(iii) c.df. of Y is given by 
Fy(y) 


(. E(y) = 0) 
= E(Y-Y)? 
= E{p?(x -X)’} 


= E{(X-X)y-Y)} 
= BE(X—X)° = Bo? 
= E(Y-Y)? 

= p°E(X-XY 

= p22 


= o 


cov (X,Y) _ Bo? 


Ox .Oy fet [B2o° 


_B ft ifp>o 
ipl \-l ifp<o 
= P(Y<y) 

= P(a+BX <y) 


= P{xs2<8] 
‘ B 
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(iv) We have : if Z is symm 
have the same distribution. 

«. X—w and -(X-p) hay 

Now Y -¥ =B(X -X) =P 

“ y—y and -(y -Y) als 

.. Y is symmetrically distr 

Ex. 11-19. Three fair coins 
two coins and Y denotes the nun 


Find: (i) The joint distribi 
(ii) Conditional disi 

(iii) cov (X,Y). 
Sol. Different possibilitiés < 


. Possible values X,Y are 
X =2,Y=0 
X=1,Y =0; 
.. Joint density function is 


Conditional dist. of y give 


PY = 0|X 


P(Y =1|X 


PY =2|X 
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ol 


(iv) We have : if Z is symmetrically distributed about constant c,Z—c and -(Z-c) 


'ty function fy(-) and cumulative 


B>.0 Seruee aa 
have the same distribution. 
; .. X—w and —(X —u) have the same distribution. 
nes Now Y-¥ =B(X-X)=B(X-p). 
and Y? “. yy and -(Y —Y) also have the same distribution. 


. Y is symmetrically distributed about its mean Y. 
Ex. 11-19. Three fair coins are tossed. Let X denote the number of heads on the first 
two coins and Y denotes the number of tails on the last two coins. 


Find: (i) The joint distribution of X and Y. 
(ii) Conditional distribution of Y given X=]. 
(iii) cov (X,Y). 
Sol. Different possibilities are 


HHH, HHT, HTH, HTT 
THH, THT, TTH, TTT 


“. Possible values X,Y are: 
X=2,Y =0;,;X =2,Y=1, X=1Y=1,X=1,Y=2 
X=1,Y=0;X =1,Y=1; X 40,Y =1;X =0,Y =2 
“. Joint density function is : 


erms of &,B and Fy(-) 


» is Y necessarily symmetrically 


“( E(y) = 0) 


2 
3? 8 
. 
8 
2 
8 
io? 
B-a° 
0 Conditional dist. of y given y =] is as below: 
<0 fx.y (01) ¥ 1 
PY =OX =l) = — = =- 
kD 4 
2 
Fey 3-2 
P(Y =1X =1) = ——— = 48 == 
fix) Y% 4 


| 
PY =2|X =1) = f- 


Saas 
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1 1 1 2 
7 = 0-0-0+0-1-—+0-2-—4+1-0-—41-1-— 
(ii) E(XY) 8 8 3 3 


+1-2-442-0-L42.1-442.2-0 
8 8 8 


22 2 6 
= —+—-4+—-=— 
8 8 8 8 
2 4 2 8 
= 0-—+1-—+2-—=—=1 
BUD 8 8 8 8 
2 4 2 
= 0-—+1-—+2-—=1 
es 8 8 8 


cov (X,Y) = E(XY)— E(X) E(Y) 


6 
=A pels. 
golis Y, 

Ex. 11-20. Consider the experiment of tossing of two tetrahedrons. Let X be the number 
on the first and Y the larger of two numbers. Obtain joint discrete density function of X and 
Y. Also find: E(XY), E(X +Y), E(X), E(Y), E(X”), E(¥”), vat(X), var(Y), cov(X,Y) and 
Y¥xy E(Y|X =1), E(X|¥ =2). 

Sol. For joint discrete density function see Ex : 11-5 
4 4 5 


4 4 
= 1-—~+2-—4+3-—+4.— == 
Ee) 16 16 16 16 2 


E(Y ay er eee ely Mees 
“) 16 16 16 16 16 


4 4 4 4 4 30 
2) = 17. 42?.— 4.3? 4.4? -— = — 14449416} = — 
ES) ie 6 es ee ar 
ae ak: ee a ee 
2) = 1?.— 42? .—43?.— 44? 
ae) io ig ie * 
= A fi4124454112}-12 
16 16 
1 1 1 1 v) 1 1 
SNe ee ee 
tak) 16 16 16 16 16 16 16 
ee Cae, OF en er ee 
16 16 16 
= sg llt243+448+648+27412+ 64} =A > 


1 1 1 1 2 1 
= —(1+1+(4+2):-—+0+3)-— +04+4)-— +(2+2)-— +(2 + 3)-— 
E(X +Y) fe JC Mee ae rae maT ( 6 
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1 
+(2+4)—4+(34+: 
( 6 ( 
1 
= 42+ 
16 
var (X) = E(X?)- 
_ 30_25 
Pe 
var (Y) = E(¥)- 


cov (X,Y) = E(XY)- 


ve a var (X 


ee fyy @, 
frixQy/D = fe) 
a fyy O%: 
fyi (412) = fy (2) 


1 
E(Y/X =1) = qilt2 


E(X/Y =2) = +t 


16 
Ex. 11-21. For the bivariate di. 
fx Qe! 
Find : E(X), E(Y), E(X”), Er 
cov(x,Y 
1 
Sol. fy(x) = forty 
0 


I 
fly) = Jary 
0 
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8 8 


2-0 


2trahedrons. Let X be the number 
liscrete density function of X and 


vat(X), var(Y), cov(X, Y) and 


’ 


135 


is ar 


1 2 1 
)i— +(2 +2): +(24+3)-— 
16 : 16 16 
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1 3 1 4 
+(2 + 4)— +(3+3)— + (34+ 4—+(444)— 
: 6 : 6 ‘ 6 ‘ ‘Te 


var (X) = 


var (Y) = 


cov (X,Y) 


Yay 


fyixQy/1) = 


Fyiy(x/2) = 


E(Y/X =) = 


E(X/Y =2) = 


410 4344454+84546418+7432} = 

16 16 

E(X*) -{E(X)}? 

30-25 _5 

A 4 4 

2 
170 (50) 55 

E(¥?)-{E(Y)}’ =-—-| =| == 

1609} 16 (#} 64 
= E(XY)- E(X) E(Y) 

135 5 25_ 10 

16 28 16 

cov(X.Y)  _ 10/16 2 

var(X).var(Y) [5 55 Vil 

4 64 

fyy(y) 

fy @ 

fyy(%y) 

fy) 

1 5 

— 41424344; =— 

at } 2 

ae 2 

—<].-— 4+ 2. — -5 5 

+{ 16 = 5 

16° 


Ex. 11-21. For the bivariate distribution given by 


Fyy Oy) = (x+y) Ton) Loy 


Find : E(X), E(Y), E(X”), E(Y”), E(XY), E(X + Y), var(X), var(Y) 


Sol. f(x) = 


fy) = 


cov(X,Y)andyyy, E{Y/X =x}. 


1 
fort ydy= x45 , O<x<l 
0 ‘ 


1 

I 
Jotyar=y+5 , O<y<il 
0 


(See Chapter 13). 


454 


Similarly, 


also 
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1 1 
1 1 tl 
B(x) = fx fx) =fo{xe3}ura ted = 
0 


0 
BW) = Yo 


1 

1 I ot 

2) = | x71 x42 |dx=—+—= 
Br f(s ;) 4-6 12 


0 
EY?) = hp 


2 5 49 11 
var(X) = E(X?)-{E(x)}" =—--—> = —— 
ue a C8) 12 144 144 
var (Y) agit 
144 


11 
E(XY) = [ fxcty)deay 
00 


E(X +Y) (x+ y).(x + y)dxdy 


i 
oy 
S Se 


= fac foceytay 
0 oO 


, 3 
: face 
3 
0 


1 
= 5 Ja {ura -2)} 


0 


1 


y=0 


NI 


3 4 4 


coy(X,Y) = E(XY)- E(X).E(Y) 
epee SOE 


1 
fh Gat ot ee ee! ae 
» 12 “6 
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Yxy 


frix(y/ x; 


E{Y/X =x] 


Ex. 11-22. Suppose that the ran 
(0,1) i.e., fy (x) = Toy (%). Also th 
with parameters n and x i.e., 


PY =y|X =X 


Find: (i) E(Y) Gi) Distributi 
Sol. As conditional distributio: 
E(Y|X = x 


E(Y 


Joint density function of X anc 


fyy (ay 


. Distribution of Y is 


fy(y 


BIVARIATE DISTRIBUTION, ats 


cov (X.Y) Vag ees 


Yxy = = 
var (X).var(Y) oe 11 


144 144 
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fyy(%y) 


Pei ts x) 


ae " Ton OY) 


0 x+— 
2 


1 
- [eed] O0<x<l. 
ix dy ey 2°33 
Ex. 11-22. Suppose that the random variable X is uniformly distributed over the interval 
= i + i = 1 (0,1) ie, fy (X) = [(93)(x). Also the conditional distribution of Y given X = x is binomial 
6 6 3 with parameters n and x i.e., 
PY = y|X =x) = "cy x7(1-x)"", y= 0,1... n. 
t y)dx dy Find : (i) E(Y) (ii) Distribution of ¥. 
Sol. As conditional distribution of Y is binomial with parameters n; x. 
7 E(Y|X =x) = nx 
dy 
E(Y) = E{E(Y|X = x)} = E(nx) 
i = nE(x) 
yen = nf xdx = a 
0 


Joint density function of X and Y is 
fy y) = £Q) frix(/ 2) 
= "By ay y=0,1,2....... n. 


aT 
¢ 1 igen 
| =p 6-1 thas O<x<1 
-. Distribution of Y is 
.E(Y) 
sit. fy) = |e, x*(-x)"" dx 
144 : 


1 


= Res fara ayrlde, 


0 
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"cy. B(yt+in-y+)) 
i ytin-ytl 


z (ytl+n-y+l - 


mn, FQ+DT.@-y+) 
Se T(n+2) 


ny! (n-y)!_ 1 
~ yl(n-y)! (n+)! n+l n=0,1, 2,........ n. 


Ex. 11-23. For the distribution given by 
Sy (x,y) =e*” 10,0) (%) To, «) (y) 
Find : (i) P(X >1) (ii) PA< X + Y <2) (iii) P(X < Y|X <2y) 


(iv) m. s.t P(X +¥ <m) == (v) P(O<X <1/Y <2) (vi) pyy. 


oO 


= foray oS py? 
Sol. fxs) = fe dy =e {-e } =e 
0 
Similarly, fr(y) =e? 
FxyQny) =e = fx) fr) 
“. X,Y are independent 
Pxy =0 
(i) P(X>1) = J fy(x) dx = fetdx=e" 
1 1 
(V) PO<X<1/Y <2) = P(O<X <1) (as X,Y are independent) 
1 
~ fetae=1-e" 
0 
(ii) To find dist. of X +Y, put 
U = X+Y, V=X 
Then X =V,Y=U-V 
O<V<U and 0SU<So@ 
ou OU 
, {ox ayy f 1 
7 jv av) to 
OX oY 
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.. Joint density function U,V i: 


fuy (u,v) 
Marginal density of U is 


fy @) 


P(<X+Y <2) 


(iv) 


Ni 


: (m+1)e~™ 


(iii) PUX <Y¥RX <2Y) 


P(X <2Y) 
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-y+1) .. Joint density function U,V is 
y+ fuyyv) = e“|J|=e™ 
vel) - Marginal density of U is 
u 
n~-ytl) fy(® = f e “dv = ue“ 
-2) 0 
n-y)! PU<X+Y<2) = PI<U <2) 
+D! ntl n=0,1, 2,000... n. 2 
= fue “du 
1 
(0, 0) (y) is 
<¥|X <2y) = |-we™ + fed 
1 
2) (vi) . 
) (Vi) Pxy = @!~2e? 4¢e! 6? 
oe = 2e! ~3¢7 
{-e? he =e" l 
(iv) rs P(X +Y <m) 
= P(U <m) 
r(y) m 
2 fue 
0 
a -y)" 
= [-ue “_e “| =-(m+De" +1 
*dx =e! ? 
1 
(m+le"" = 2 which gives m. 
{as X,Y are independent) (iii) P(X <YAX <2¥) = P(X <Y) 
0 y 
_ le fer*dy dy 
0 o 
: ferfi-e>} dy 
0 
Seed 
2 2 
=~] 


P(X <2Y) 


il 
ee ) 
is) 

1 

se 
I? 

o f 

e 
jeaoraeen goer 

& 
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M 


kee 
3 3 


1 
P(X <¥|X <2Y) = val 


Ex. 11-24. For the bivariate distribution given by : 
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fey (ey) =e (1 e™) Teg.y) (2) [io,0) (9) +e “Ue ) Tea.xy (Y) Leo,0) (X) 


(i) Show that: fyy (°) isp.df. and find 
(ii) the marginal distributions of X and Y. 
(iii) E{Y|X =x} for 0<x 
(iv) P{X $2;Y¥ <2} 


(v) Pay 


(vi) another joint prob. density function having the same marginals. 


sot. (i) Jf fer») dedy=h +1 
xy : 


¥ 


where I, = Jo fera-e"ax 
0 


0 


= ferayly-1427} 
0 


M1 


0 

= i-14t-2 

2 2 

2 x 
Similarly, I, = fax Jeta-e? dy => 
x=0 y=0 
J J fers») dxdy = dy a4 
, 2 


freay -| e dy + e dy 
0 


ao 


0 


1 
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Also fyy (sy) 20 
«. fy (-) is prob. density functi 


(ii) Marginal distributions are as | 


fy) = fede aes, 


= ye”. 


This is because : region of integra 


region for given y, x varies from y to o 


f(x) = 


I 


(iii) frix(y/ x) 


e?(l-e”) Lox 0) (Y) T(0,00)(X)' 
xe *] 


1 


xe” 


[ePd-e ray) Fe 


(iv) P{X <2;¥ <2} = 


459 
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Also fyy (x,y) 20 
dy .. fxy (-) is prob. density function. 
(ii) Marginal distributions are as below : 


fy) = [iera-e*ar+ ede? bdr 


= ye”. 
This is because : region of integration for second integral is shaded portion and in this 
region for given y, x varies from y to w. 


fy (x) = J e%a-e* ay +Petd-e)ay 


x 


—e*) To.2) (Y) M(0,00) (*) 


xe * 


Fxyy) 


(iii) Frix(y/ x) = fy (x) 


= {e1-e*) Loy) (2) 10,0) 9) 


$e *(1-€) Lo. (9) Looe}. 


he same marginals. peers ree 
xé~* 10,00) (X) 


e(l-e™*) Kx,20)(9) T (0,0) (%) te (le) Lo.) T(0,0)(X) 


xe 1G.) (X) 


[era Seip woltes dee7) Loas(0)| 


i 


Xx 
xe 


. eoxy= | [ytd dye f yer | 
xe * | ¥x 0 


oo) x 
5 =l+—. 
"dy+ fe *dy 2 
° 2 y 2 x 
iv) P{X <2;¥ <2} = fay ferd-e*)de + fax fe“a-e yay 
y=0 x=0 x=0 y=0 
1 2 2 
a, a Ay ay i> —x -xX 
e ays ~ je {y+e I} dy + Je {xe thax 
y= xs 


0 


2 2 
_ {fo —le “dx jem 
0 
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(v) E(X) 


E(X’) 
var (X) 


Similarly — var (Y) 
cov (X,Y) 


cov (X,Y) 


Pxy 


= fx fxyQ= [rete 2!=2 
0 


= [2?-xe“dr = 31-6 
0 


E(X*)-{E(X)}’ =6-4=2 
=2 
= E(XY) 


x=0 y=0 


fren oto tDe” “ila 


y=0 


o 


+] xe* 


0 


és 2 
-x]x nae. 
= af xe (= +(x+ De thas 


1 1 1 
= ranepet a] 
= 64+1-2=5 
= E(XY)- E(X) E(Y) 
= 5-4=1 
cov (X,Y) | 1 


= — 


(s.dof X)(s.dof¥) 2 
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foe) , o x 
J[pvera-e ar ay +f faye"d-e? yavay 
y=0 


2 
= +(x+De%* - thas 


(vi) An infinite family of joint probability density functions, each having the marginals 


fy (x) and fy (y) is 


Fry ysa) = fy 2 fy(n[t+of2Fy (a) 1}{2F, 9) 1] 
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where —1 <a <1 (See Ex: 11.7 


Take a 
.. New joint probability densit 
fy Oy) 


Ex. 11-25. The joint probabilit 

fx y sy) = 3 

Find ; (i) the marginal densit 
(ii) P(X+Y <0.5) 

(iii) E{Y/X =x}, E(X), 


(iv) cov (X,Y) 
Sol. The region of integration i 
(i) For given x, y varies from 


Oto 1-x 
: 1-x 
o fy(x) = 3[ @+y) dy 
2 I-x 
= oe] 
y=0 
= 5-2") 


(ii) P{X+Y <0-5} 
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where —1 <a <1 (See Ex: 11.7). 


Take a =0 
.. New joint probability density function is 


fey (xy) fe feof) 
= (xe™*)(ye™”) 


= xye*, O<xs0 


O<yso., 
Ex. 11-25. The joint probability density function of X and Y is as below : 
Fyy ny) =3(xt yoy (*+ Woy oy) 
Find : (i) the marginal density of X 
(ii) P(X +Y <0.5) 


(iii) E{Y/X =x}, E(X), var(X), E(Y) 


(iv) cov (X,Y) 


De) x 
J Jena ~e”)dx dy 
Sol. The region of integration is shown shaded in Fig. 


x=0 y=0 
(i) For given x, y varies from 


Lay Oto 1-x 


: l-x 
fy) = 3 + y)dy 


2 I-x 


oO 


2 
-x|x ae 
+f xe (= HO Ne thas 


(0) 


fa 


ix + xe * dx — ie rede] 
0 0 


2 
y=0 


=50-2) 


(ii) P{X+Y <0-5} 


0-5 0-5-x 
= J [3cr+ dedy 
x=0 y=0 


actions, each having the marginals 


t{2Fy(y)— 1] 


462 
(ii) frix (1X) = 
E{Y/X =x} = 
E(Y) = E[E{Y|X = x}] = 
(iv) E(XY) = 


E(X) 


E(X*) = 


var(X) = 


_ cov (X,Y) = 


bas 
se ~3x+2)dx =— 
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Fxy ey) __3(x+y) 
Ix) Su-2%) 


* (x+y) 


(4° 


x x+y) 
i ae eee dy 


2 3 l-x 
cae Wee i Ee 
1l-x* 2 3 


0 


ae {x3 -3x+2} 


1 
E 243 
Late (x +0 


lf ol 3 3 3 
share SORE )dx 


5 [a -3r42)ae= 


fs xy(x+ nay} 


1 
10 


I 3 I 3 
[feo ax e 5 [xd-2 r= = 


3f' 2 ee. 
= (-x*)dv=— 
1 9 19 
BOOMER) = Se 
tECOY = 5-35 = 300 


E(XY) — E(X). E(Y) 


13 3273 
10 8 8 320 


Ex. 11-26. The discrete density of X is given by 


fc) = 3.4512 


and fy;x(y/x) is binomial with parameters x and > 
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x 


ees faee |, 
ie, frixQy/ x)= c,(2] ,y=0,1 


Find : (i) E(X), var(X) 
(ii) E(Y) 
(iii) Joint distribution oj 


Sol. (i) E(X) 
E(X?) 

var (X) 

(ii) EYY/X: 
E(Y 


(iii) Fry Qy. 


11.6. Bivariate Normal Distribut 
Let x and y be linearly correla 


same. 
Now Eq. of line of regression 


y=, 


“ 


=> 


Where P = Correlation co-efl 
Which gives the mean of the \ 
Also standard error, of estimat 
This is also the s.d. of y for gi 
. Conditional density f” of 


od 
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+y) 


e+ 2} 


ax+3) 


bx+2) 5 (La? )de 


w= 


«+ yaya 
\de= 4 

10 

3 

x(l-x*)dx == 

) 8 
oi 
5 
-1_9 19 
5 64 320 
V) 
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x 


ly 
ie, frixQy/ x= ‘e,(2] ,y =0,1,2......x. 


Find: (i) E(X), var(X) 
(ii) EY) 
(iii) Joint distribution of X and Y. 


Sol. (i ny Sept eee ee 
ot (x) a> 4 3 83 
1 8 
2) = Dx?-S ata} 
2 al eae We 
var (X) = E(X?)=(E(X)) =3-2.=2 
i E(YIX = ges 
(ii) ( ) 8 


(iti) FyyQoy) = fyrx(y! x): fx) 


: 4). 
_*/(! 


ws 


11.6. Bivariate Normal Distribution 
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Let x and y be linearly correlated normal variates such that variance of y for each x is 


same. 


Now Eq. of line of regression of y on x is 


=L(x-3) 
pon x-x 
ya eS. 


~ 


ee ee 

y+p—(x-x) 
x 

Where P = Correlation co-eff. between *, y. 

Which gives the mean of the values of y for given x. 


> y 


Also standard error. of estimate of y on x is oy y1- 0? 


This is also the s.d. of y for given x. 


. Conditional density f” of y for given x is 


x 


oy 
»-[osesteen} 
Go 
2 


2 


Syyl-p 


a IBS BIVARIATE DISTRIBUTION 


)2 


1 (y-F)- poe (2-3) te 
joo. Put 1p? = u,¥ =1 
1 2 8, 1-p? 
——= —— € 
Oy yi-p? V2n Ou 
u,v) — jOX 
Now the density f” of x is axX,Y) Ou 
1 ze oY 
1 e? 3, 
o,Vv2n 1 
. Prob. density f” is given by 1-{ 
il feegy? [O-D-PO-D] __ Pp 
: “2 = } ‘ oyyl-p? i- 
fay) = —=———— 
V2n 6,. 2no,y1-p” 1c po pa eee 
Total prob. == J fe 


+. (1) Represents a density f”. 
normal distribution. 


11.6.1. Marginal and Conditiona 
Marginal density of x is given 


sali 
= = me; 2 
x-¥){y-y)_ (y-y 

. G, I 5, }-( S, 1 (A) 


Both x and y vary from —c to oo f(x) = [ fe. 
a) 3 
X~-X 
. Total prob. SI fex “(=| 

” One, So ae pce 2p?) ~p’) 2 

; 2710 ,C 
= = a: 
2p rae | ae ae Wy We Ae dx dy 
o, Oy Oy 
xX-X yr-y 
. Put a ae = 
ut = 
: y 
apo lled-t 
Fa Pad ~2pXY+Y » acer 

2n 1-p? 00-00 = 
206 .4 
al jo {(X~p¥)?+(-p?)¥?} i 

= dX dY 

ae Be Se p”) _ 

For def. of correlation see chapter 13. 200 1 
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)” X-pY 
(x-¥) ert. 7 
ra Put 1-p? =u,Y=v 
ju ay 
Au, v) OX ox 
HX, Y) Ou Ov 
oY oY 
1 
: 0 
= tJ a 7) - 2 
oe I. (-P)-PLa-2) __ Pp i l-p 
Oo, oy 1p? 1 = p? 


e “hb é v 


* ., (1) Represents a density f”. The bivariate distribution given by (1) is called Bivariate 
normal distribution. 


11.6.1. Marginal and Conditional Densities 
Marginal density of x is given by 


— 2 : 
y-jy 
-( S, (1) 


felx) = | flsy)dy 


” Qno,0 seater Je co satan (2) 23) 


Put =¥, =X 


(x? -20xr¥?)| car 5 
fe. fox y? ee 
2noxJl=p” ~2(1-p”) 


(X-pY)?+(1-p2)¥?} 
dX dY 


ll 
nN 
iss) 
wits 
% 
8 memeny 8 
2) 
Pas 
‘+ 
| 
Ss) 
~~ 
~— 
_ 
a) 
— 
= 
ao) 
oe 
eee | 
a 
= 
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1 wly? % ty? ma X-X 
= e 2 fe 2 dg. where 9 = i (ue Put X= ; 
206 ,. pid 1-p? Ox 

: => dx = o,dXx, 


tl 


Moo (th, t2) 
any|l- 


o,v20 


Similarly marginal density of y is given by 


AQ) = Leet] 


y 


Then term within { } brack« 


2 2 2 
Conditional density f” of x given y is given by X° -2pXY¥+¥°-20-p°)( 


f(xy) 
f,Y) 


= (X-pY)? -201-p?) ho x6 
Friy(4/y) = 


= {X-pY¥-(1-p’)t,0,}" +1 


= (X-p¥-(-p")40,)' + 


2 
neerh Seer: yin 3} 
V2n.0,.y1-p? 21-p*) | Sx Sy 


= {X-p¥-(1-p?)to,} + 


_ 6 ms 
=> Conditional distribution of x given y is normal with mean * + P mS, ~Y) ands.d. 
y 


2tyl-p 
6,yl- p” 
5 a Oy _ 
Similarly condition distribution of y given x is normal with mean Y + oe —*X) and Put xX 
x 
s.d. o,yl- p*. 
11.6.2. Moment Generating Function Ou 
Ox 
Mo oltyste) = Efe An, v) - 
AX,Y) — [Ov 
OX 


i 


ffersa, y) dx dy 


-D- 
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where 9 = 


d$ 


- 


2 
r-[FepSo- >) 


Oo, 1-p? 


vith mean ¥+9—* So Y) and s.d. 


a, 9 = 
ial with mean Y +P—(a-X) and 
x 
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x-xX ya27y 
Put x= 3, ’ 3, 
> dx = 9,dX,dy=0o,dY 
etthy oe (Xt -exY HY?) 
Moo (ty,t2) = - | [ee eit MER) dX dY 
2nyi-p* pr 25 Se 
elit thy 0. 
= ——— | fos] ts 1” —~2pxXY+y? 
Qnyl-p? ©, a(1- 


~2(1- p2)(t40 ,X + 10 ¥)} dX dY 
Then term within { } bracket is 


X? ~2pXY +¥? -201-p’)(t10,,X +)0,,¥) 
= (X-pY)’ ~2(1-p”) to ,(X - pY) + (1- p’ ){¥? ~ 2pt}0,¥ - 240 ,Y} 
= {X-p¥-(1-p*)t0,}° +(1-p”) ¥° -2pt,0,¥-(-p’)to,” -240,Y} 
= {X-p¥-(1-p’)to,}* +(1-p*) {(Y-prt, 5)” 
~2t0,(Y —ptyo,)-t,' 5," — 2ptt,o,0,} 
= {X-pY¥-(1-p”)t0,)? +(-p’) {¥ ~pt\o, -1)0,)” 
~(t0," + 2ptytro 0, +t,"o,)} 


meer as | 
Ft y+, (170; +2phyto0,0,+130;) 


Mo. (ty, t2) = e 


(X-pY -(1-p*)449,) 
+(1-p*)(Y -prt,o, = toy) 


— JJ 
—= | ] exp 
2nyl—p? -2(1-p”) 
Put X-pY-(1-p)t,0, =uyl—p? 


Y—-ptO,-to, =v 


dX dY 


ou aul 
Ox oY 
Hu, v) 
X,Y) jor ay 


ox oY 
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ll 
oOo 
_ 
— 

| 
uo) 


o Moo (tt) 


Se tell 
exp E +ty+ sliie: +2pt)t,5,0, + #03) 


Cor. (1) if x=y=0, Oo, =, =1 


Lig +2ptyty +13) 
Moot >t2) = e2 


(2) Mg z(t ty) = Efe P*2O-Y)} 
= e WF) My, (t, 6 ) 
sro? +2ptyto,a, +h, oy) 


which gives m.g.f. about (x, y) : 
Ex. 11-27. For the bivariate normal distribution : 


1 
dP = >= & 
2nyl-p? 


where x, y denote the deviations from their means, find m.g.f. about mean (x,y). Deduce 
that . 


of aap tomes] 


Hy =(r+5—Dpp, 3.4 t+ -DG-Dd- p”) H,2,5-2 and hence show that 


L,; = Oif (r +5) is odd. 
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Sol. As in, cor. 1 it can be shown 


L249, 


Mx 5(t, ty) =e? 


let Mz 5(t,12) be denot 


M= 


disap ty +t") 
e2 


. L' 
> log M = sn + 2ptit. 


Differentiate partially 


= at, 


and Oty 


OM 
— = M(t, + pt) 


OM 
—=M(t, + pt) 


Differentiating (3) wrt. t, partial 


aM 7 ieee 
Ot,0t, = Ot, 
= M{t, +t. 
= M {tt,(1+ 
a’M 6M 6M 
* Brat, ee eta 


= M {t,t2(1+p7) + p(t? Hi? Pie 


=M{p+(1-p")tty} 
Now M= 


aM 
Ct, 


EEh 
pete 
EE ms 
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it 


) es —2pxy + a) 
id m.g.f. about mean (x,y). Deduce 


-2,5-2 and hence show that 
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Sol. As in, cor. 1 it can be shown that 


-let 


and 


1 2 2 
(ty +2ptyt, +1") 

=p2 “ 
My s(t,t)=e7 | 


M5 (ty, ty) be denoted by M. 


1 3 2 
7 +2ptt +t." ) 
e2 


M= 
1: , 
log M = sr + 2ptit, fi) ...(1) 


Differentiate partially wrt. t;,t, respectively 


— —=1t,+pt 
M ot, 1 + Ply 

OM 

ea = M(t, + pt) | ...(2) 
OM 

a, Pea) ...(3) 


Differentiating (3) wrt. t, partially. 


ou = Gy (2+ en) + Mp 
= M {t, + pty) (t, +pt,) + p} 
= M {tt,(1+p?)+ pt? +12 +1)} 
Sp ee 
Ot, Oty 


= M{ht,(+p?) +p? +8 +1)-pt, (t, + pty) pty(ty + ph )} 


= M{p+(1-p*)yt)} 


OM i, Ot 
at, Didi STE 


5 


a°M as 
ue = Hs 
Ot Ot, 2 (r-])! (s-D! 


r sO 
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ty 


Substituting . @ 
PDL De es a Gop 


rr. 7 ait ai PLL (r—-1)! s! 
= te ee Fe (ap Lh a 8 ae 


zt 


s-l 
i) 


(r-1)! (s-D! 
—pr-l Heat, s-1 ~ p(s —1) Hy-1,s-1 


Equating Co-efficients of ————| 


= Pip a.e1 tl p7) (7-1) (8-1) Hy, 5-2 


=> yy = (+ S- NPM HP-D(s- DU ~ pH, -2, 5-2 (5) 
we have Hor = Bio =0 


Put r=l1s=2 


Similarly Mo, =0. 
Put ‘r=2,s=3 


bog = (243-1) poy, +1-2.(1-p?) Hoy 
=0. 
Similarly 4) =0. 
Also we have 1,,, =[o, =0 if 7,5 aré odd. 
Now if R + s is-odd, so are 
(r—-I)+(s-1) = r+s-2 
and = (r—2)+(s—2) = p+s—4 


and so on 
.. We have 


H,, = Oif (F +5) is odd. 
Theorem 11.6.3. If (x,y) have a bivariate normal distribution then x and y are 


independent iff x and y are uncorrelated. 
Proof. If x and y are independent then 


Cor (x,y) = 0Oie., x and y are uncorrelated. 
Converse. Let x and y be uncorrelated 
>p =0. 


. : a 
1 lL |{x-x y-y 
—— - + 
f(y) = 216 ,Oy a 2 { o ( Go ] 
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I 


o,v2n 
f-f, 


=> xand y ; 


Remark : If (x,y) don’t have < 


that x,y are independent whenever | 


and y are normal (but their joint dist. 
Ex. 11-28. For the bivariate non 


_ aP = const exp 


show that correlation coefficient bet: 
Sol. Here 


EQ 
Bere 
Similarly 
Now joint m.g.f. of x,, x, is 


1 9 mf 
=(ty +t +2 
gti ++ 


Moo (ty, t2) 


1+5(t 4 


E(x!) _ 4’ (0) of 
4! 4! 


tl 
Q 
o 
o 
m3 
° 
g. 


. E(x;")=3 
Similarly E(x,*) =3 


Also E12) _ 1420.0) 
212! 212! 


= Co-eff of 
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: exp ie | es ieee exp oe) oe 
~ 6, V2n 2|\ oy o,V2n 2\ oy 
— f,).f,0). 


> x and y are independent. 


t5 tt 
PL Dab rl (s—D! 
Ss 


Remark : If (x,y) don’t have a bivariate normal distribution then it is not necessary 


that x.) are independent whenever p = 0. This is so even when marginal distributions of x 
and y are normal (but their joint dist. is not bivariate normal.) 


“D Br-ts-1 Ex. 11-28. For the bivariate normal distribution 
i(s-1) Hy-2, 5-2 2_9 + 2 
eves: {ston | 
1)(s-DU-p*)H,-2, 5-2 wif) ~2(1—p*) 
show that correlation coefficient between x,* and x,” is p” 
Sol. Here 
x, = 0, var (x,) =1 
E(x?)-x? =1 
=> : E(x?) =1 
2 
P”) Ho, Similarly E(x,’)=1 


Now joint m.g.f. of x,, x, is 


slit +2pH 09) 
Moo (4, ty) =e 


2 
ae Pe ee hl ie - 
= 1+ (ti +t; +2pt, natal +5 +2phty } teeseens 


E(x}') _ U4 (0) of x 
4! 4! 


ormal distribution then x and y are | 


lated. 4 
E(x, ") = 


” E(x,*)=3 


ll 
vs) 


Similarly E( x9") 


Also EQ ¥2) _ Hi2(0,0) 
2!2! 212! 


= Co-eff of i e 
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a eae 
Juli +-—- 
ale 5) 


E(x,7x.") = 2p? +1 


- COV ine x3) 
(s.d. of xy) (s.d. of K) 


Y x23 


{EGt) (EGP)? }  {B3)— (£02) } 


Ip 41-1 . 


JG-DG-D~ 


Ex. 11-29. Let X and Y have bivariate normal distribution with parameters x = 5,6, =1,.- 


y =10 and o, =5. 
(i) if p>O, find P such that 
P(4<y<16]x=5) =0.954 


(ii) if p=9, find P(x+y <16). 
Sol. 


: 2 2 
1 -1 x-5 #3 )(25") =") 
— poe ieee -2 —— |4+] —— 
FryQy) a er oo] to i of 1 5 5. 


, ieee 4(22) 
an . f(x J2n p 2 1 
feysy) 
fyixy/x) = Fh) 


: 1 eels ie =2_o(==)l 
fn 5yft—p? *loa=p%) | 5 1 


1 1 y-10 
2Q|x=5) = ——————, exp +53") 


E(y/ x= 5) = 10 
; = 
and = var(y/x=5) = (s l-p } 


y—-10 
5 l=p? 


Put . z= 
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Pi4<y<16|x=5} 


6 
* PSO <7 < mex = 5 
5¥l-p? 


.. From Normal Tables. 


6 

5y1-p? 

| 1-p? 

(ii) If p=0, Xand Y are inder 
xisaN(5, 1) 
yisaN(10, 5) 

. t=xty isaN(15, 26) 

P(x+y $16) 

Put Uu 


‘11.7. Bivariate Transformation 


Let x and y be two random varii 

Let x and y be transformed to variate 
x: 

where u, v are continuously different 


Let J 


Then the joint probability densit 
) z lJ 
Ex. 11-30. If the probability den 


f(xy) 
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E(x3) 
xf) (E(3))"} 


ation with parameters x =5,0, =1, - 


954 


11.7. Bivariate Transformation 


Let J= 
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P{4<y<16|x=5} = peed 


6 0.954 
" fy eae ee a = > =0 
.. From Normal Tables. 
a sO. 
SJi-p2 ~2 


1-p? = 0-365 p=0-8 


(ii) If p=0, X and Y are independent 


xisaN(5, 1) 
yisaN(10, 5) 


“ t=xty isaN(15, /6) 


P(xt+y<16) = P(t<16) 


~15 


t 
Put = 


Fb. 


= Pu< 


26 


0-5793 (using normal tables). 


= 0-196) 
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Let x and y be two random variates having joint probability density function f(x,y). 


X= x(u,v); y = y(u,v) 


Ox 
Ou - 


x 


Ou 
Then the joint probability density function of u and v is 
|J| SF {x(u,v), yatv)}. 


Ex. 11-30. /f the probability density function of two variates x and y is given by 


ey= | 


where u, v are continuously differentiable function. 


Ox 
Ov 


oy 
Ov 


+#0.° 


~ Let x and y be transformed to variates u and v by the transformation 


Axy et). 29, y20 
0. 


elsewhere. 
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Then v20, O< v<S2u 


Find the density function of ee py 


Sol. Put u= yxoty’, v=x 


Then v20, u2v. 1 
u2=0,-O0<v<u 7 - 
‘ 1 u,v) 
Ww —_— = 
: J &x,y) 
au ou : 
Ox oy 
= .. The prob. density function of u 
& 
Ox oy 


.. Density function of u is 


I 
tay 
N 
-?}- 
< 
nN 
cad 
nN 
-F/ 
ed 
nN 
It 
cad 
ale 
+ I< 
see 
Nn 


.. The probability density function of u and v is given by 


FuyUsv) = f(x, y)|J| Ex. 11-32. The joint density funct 
f(x, y) = 
eae 
(Arye) V8 a ,x20,y20 = 
y Find the distribution of x + y. 
0 elsewhere 
: Vv 
_ J4uve™ u20,0<vsu. 
0 elsewhere 


.. Marginal density function of u is 


u 
ccopae ~y2 
4 Juve  dv=2we™ ,u20 
v=0 
.. Density function of u is 


Wwe" u20 
0, - elsewhere 


Ex. 11-31. The joint density function of x and y is 
fy y) = e&O”, x>0,y>0. 


x+ Sol. Put u=x+y,v= 
Find the prob. density function of y gee ee a 
2 X=U-V,y=V 
: i >0>v>0 
Sol. Put “u= xem Yivex. y, 
‘ 2 O0<x<l1>0<u-v<l 
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Then v20, O<v<2u 


On ot 
Ml 
2 
2/2 2|2 


OoOnNl|- 
ll 
| 
| 


.. The prob. density function of u and v is 
AER 6 


- Qe" 
.. Density function of u is 


2u 
2 J eo" dy 
0 


= 4ue™™". 
Ex. 11-32. The joint density function of x and y is 


f(x,y) = 2xe%,0<x<lLy>0 
= 0 elsewhere, 


Find the distribution of. xt y. 


u20,0<v<Su. 
elsewhere 


re 


> 0. 
Sol. Put w= x+y,v=y 
X=u-V,y=yvV 
y>O0>v>0 


0<x<1>0<u-v<il 
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*. uw and v vary in shaded portion. 


ou ou 
Ox oy 
1 
Now Foie 
Ox oy 
1 1 
bf 
J=1 


.. The joint density function of w and v is given by 


F(u,v) = 2u—vye™" 
To find the density function of u divide the shaded portion in two parts marked I and II 
by the line x = 1. : 


In part I, 0 <4 <] and for given u, v varies from 0 to u and in part H, «21 and for: 


given &,V varies from ,-—-| to u. 
. Density function of uv is given by 


f,(w) = fru -vye"'dv 
0 


= 2 [-(u -vje’+ aA 


= 2(e“+u-l) for O<u<l 


and for u21, 


u 
eu) = 2 Jo -vye "dy 
u-t 
= 2{-w -vje + a 
: u-l 
= Qe“ eH Dy_ gD) 
= 2e" 
Ex. 11-33. /fx, y are independent standard normal variates, show that ¥ Jollow Cauchy 


distribution. 
Sol. Joint dist. of x and y is 


J wate Wek, ears 
P= Van | | Vom 
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7 x 
Put ; 
Then both u, v vary from —% t 
x= Uy, yoy. 


.. Joint dist. of 4, v is 


- dP 


.. Dist. of u is given by 


dP 


Ex. 11-34. If (x, y) have a bjm 
variates with correlation co-efficier 
Sol. Joint dist. of (x, y) is 


dp 


Put ; ie 
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8 x 
Put — =u, y=v 
y ; 
Then both u, v vary from —2 to +00, 
x= uy, y=. 
ox 
Ou Ov 
J= 
ay yy 
Ou ov 
4 _|v Wy, 
; ~ 10 1). 
portion in two parts marked I and II , Joint dist. of uv is. 
1 ‘ 2 
i : ~=v"(l+u?) 
0 to w and in part II, w>1 and for dP = x ar ‘iol 
7 


.. Dist. of u is given by 


Loa? 
—sv (l+u") 
dP = a fe 2 |v| dv 


~ 2n 
P 00 
Pa du ~tv?(+u) 
ais “fe z v dv. 
nt 
: 0 
or O<usl 
1, 2 
-rv (l+u*) 
du j.e 2 
= | Cea) 
; 0 
-y]# 1 du 
e =— 7, 
ie us l+u? 


Ex. 11-34. /f (x, y) have a hjvariate normal distribution s.t. x, y are standard normal 


~(u-1) 
-e } 
variates with correlation co-efficient p between them, show that x/y follow Cauchy dist. 


Sol. Joint dist. of (x, y) is 


. 
triates, show that — follow Cauchy ~~ (x*~2pry+y?) 
y u (1-9) dea 
_  2nVl-p? : 
x 
Put ; eas ae aaa 
y 
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Ox Ox 

Ou Ov 

J — 

oy ey 

Ou ov 

v u 

= =v. 
0 1] 


.. Joint dist. of u, v is 


Il -— {u?-2pu+1) 
dP = gre |v|dudv 
2nyl—- p° 
.. Dist. of u is 
du * ers {u?-2pu+1} 
| ot el fe _|v|dv 
2nyl - p? oe 
du f saw apuel 
== Je vdy 
tyl-p? 9 
v? 2 
Put 5, (ue —2pu+lj=z 
2(1-p*) 
, 
1-02 
nas Nie ee 
: u“—2p utl 
yi-p? 
ap = —+—* fetde 
Tu" —2pu +1) 
3 V 1- p? 
=r 3. le ta ee 
m{(u+p)’ +(1-p”)} 
which is Cauchy distribution. 
EXERCISES 
1. Calculate the missing entries in the following bivariate distribution table : 
yo 0 1 2 Total 
xt 
| 0 — —_ 2 
2 — 3 — “4 
3 - 0 — ‘1 — 
Total ‘| — 2 
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2. The joint density function of a b 
f(x, y) 7 


(i) Find marginal and conditio1 
(ii) Are x and y independent ? \ 
3. Obtain the marginal and conditio 


f(x y)=e( 
: =0 
4, Ifthe joint density function of a 
SQ, y)=c. 
=0 
Find (i) P(x > y) 
(ii) Pixt+y<D 
5. Given the following bivariate pr 
x> 2 
yt 
0 -_ 
1 
1 as 
; 1 
2 = 
| 


Find (i) marginal distributions : 
(ii) the conditional distribi 
6. Random variables x, y have the. 


f(x, y)=— 
2n 


Find the marginal densities of x, 
7. Two stochastic variables x and ) 


as follows 
yo 
xt 
1 C 
2 ¢ 
3 C 


Find E(x), E(xy) and E(x + y). 
8. Given the following freq distrib 
the correlation co-efficient : 
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2 
v 


z {u?-2pu+l} : 
inp) |v|du dv 


2 
v 


: — {u?-2pu+]} 
2-p") _|v[dv 


2 
ve 


(I-p”) 


{u? —2pu+l} 


2putl=z 


e* dz 


ot 8 


-p”)} 


iriate distribution table : 


Total 


2 
4 
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2. The joint density function of a bivariate distribution is given by 


_jexy, O<y<x<l 
fry d= 0 otherwise 


(i) Find marginal and conditional distributions. 
(ii) Are x and y independent ? Why ? 


f(x, y)=c(2-x-y), O<y<x<l 


=0 _ elsewhere. 
4. If the joint density function of a bivariate distribution is given by 
fa yy=e.eO™, x20,y20. 
=0 elsewhere 
Find (i) P(x > y) 
(ii) P(x+y <1) 
5. Given the following bivariate probability distribution. 
x—-> -l 0 1 
yt 
; ¥ 1 2 
15 IS 15 
3 
1 —_ — — 
15 15 15 
: 2 2 1 
15 15 15 


Find (i) marginal distributions of x and y 


(ii) the conditional distribution of x given y =1. 


6. Random variables x, y have the joint probability density 


1 x? —2kxy + y* 

f(x, y) = —=== exp {tee 
anvi-k? 2(1-k?) 
—O<X,y< a, 


Find the marginal densities of x, y and also their co-efficient of correlation. 


Find E(x), E(xy) and E(x + y). 


the correlation co-efficient : 
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. Obtain the marginal and conditional probability functions for the following distribution 


7. Two stochastic variables x and y take the values 1, 2, 3 only and their probabilities are 
as follows 
y> 1 2 3 
xt 
1 0-1 0-1 0-1 
2 0-1 0-2 0-1 
3 0-1 0-1 0-1 


. Given the following freq distribution, find the mean values, variances, covariance and 
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a ata el Dt a eed 
> <i 1 Total 
yt 
1 20. 80 100 
I 80 20 100 
Total 100 - 100 — 200 


10. 


11. 


12. 


13. 


14, 


Now let x and ybe independent M(0, 1) variates. Show that u = ~ has the distribution 


ly| 
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State, with reasons but without making any calculations, what shall be the corresponding 


results if the two frequencies in each row are interchanged. 
Let the joint density f” of x and y be given by 
_ J8xy ; O<xey<l 
fy)= {o , elsewhere. 
Find E(y/ x), E(xy/ x), var(y/ x). 
foots 2 l+xt+x? 2 xU+x+x7) 1+2x-6x7 +2x3 +4 
3 lex 730 dex 18(1 +x)? 


For the bivariate distribution 
fix y) = (6-2-9) O0<x<2;2<y<4. 


Find E(y/x),  E(y?/x), var (y/x), E(xy/2). 
Also show that 

E(y) = ELE(y/x)}. 
The joint p.df of x and y is given by 


fdas O<x<y <1 
f(x, y) = ‘ , otherwise. 


: 2 
Show that the conditional variance of x given y is a, 


*1,*2 are independent random variables having the same Cauchy distribution 


,-O<X<H, 


f(x) = 


n(1+x7) 


Show that y =X, +x, has a Cauchy distribution. 


Let.x and y be independent variates which are uniformly distributed over the unit interval 


(0, 1), find the distribution f” and p.d.f of z= x+ -y.As z a uniformly distributed 


variable ? 


Ifz is M(O, 1), show that the p.d.f (i.e., prob. density f” ) of |z| is 


1 
ex: ailed 
f(2)= (2) 1x20. 
TT 


specified by 
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15. 


16. 


17. 


18. 


19. 


20. 


g(u) =- 


Suppose that x, and x, are inc 
normal distribution, then : 
(i) Find the joint distribution of 


*y 


(ii) Show that 2x,x, and x,” - 


x and y are independent random: 
has the chi-square distribution w 


x 
Find the prob. density of dyin 
Hence, deduce that if x. 2 are in 


then Z has a Cauchy distributic 


a 


Let 


F(x): 


be p.df. of the random variable 
density f" of y=x’. 


Ifx and y are two independent ri 


and BI 


Find the probability distribution 


If the joint density function of t 


density function of w= x+y is 


If x and y are independent Bi 


respectively, 
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Total 
100 
100 | 
200 


ions, what shall be the corresponding 
‘changed. 


x<y<l 
where. 


5x7 + 2x3 +x4 
(L+x)* 


22<y<4. 


e same Cauchy distribution 


co. 


ily distributed over the unit interval 


+ y.As z a uniformly distributed 


" f") of || is 


r 
w that w= bl has the distribution 
; ¥ : 
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p 
B(u) =~, [0 <u <a, 
n(1+u7 ) 


15. Suppose that x, and x, are independent random variables each having a standard . 


16. 


17. 


18 


19, 


20. 


normal distribution, then : 
(i) Find the joint distribution of 


Xp +x Xy-X 
t= and 2 1 
V2 V2 
(ii) Show that 2x,x,. and a9 - ae have the same distribution. 


x and y are independent random variables. x has the standard normal distribution andy 
has the chi-square distribution with n degrees of freedom 


Find the prob. density of .ar 


Hence, deduce that if x, 2 are independent and have the standard nermal distribution, 


then = has a Cauchy distribution. 


” 
ra 


Let 


1 
f(x) = Pm es 
0, elsewhere 


be p.d.f. of the random variable x. Find the distribution function and the probability 
density f" of y=x?. 


. Ifx and y are two independent random variables with density functions 


f(x)=e™", x20 
and ey) =2e°", y20. 


, x 
Find the probability distribution of “= 5 


{Ans : | 
; (u+ 2)? 


If the joint density function of two random variates x and y is f(x,y), show that the 
density function of u=x+ty is 


fiw. u—v)dy 


oO 


l 
If x and y are independent Binomial variates with parameters 3; 3 and 2; — 


respectively, 
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—_ 


21. 


22. 


23. 
24. 


25. 


26. 
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find P(x =-y) 


(Ans. 10/2°) 


If X and Y are two random variables and E{y|X = x} =[, where } does not depend 
upon X, show that 


var (Y) = E{var(Y¥ /X)} 
Define moment generating function of y / x = x, Does M,(t) = E{My,x(t)} . 


Ifx and y be two independent random variates, does E{Y / X = x} depend upon x. 


If the joint moment generating function of x,y is given by 


My. y(tysta) = exp {(1? +13)/21, 

find the distribution of x. 
Let X, Y be random variables with joint probability density function fxy (xy). Let 
U(X) and V(Y) be functions of X,Y respectively. Then show that 

E{U(X)V(Y)|X = x} = U(®E{V(Y)|X = x}. 
The trinomial distribution (multinomial with k+1=3) of two random variables Y 
and Y is given by 
n} 


PODS Tah 


pq’ i=p-q 7 

for x, y=0,1,........ n and x+y<n where 0< p,0<q and 0< p+q<\. Find: 
(i) marginal distribution of Y 
(ii) the conditional distribution of X given Y and obtain its expected value. 

(iii) Pxy- 


(See section : 10-8) 
O00 


Special Con 


12.1. Uniform Distribution 
It is a continuous dist. given by 
dF = - 
d 


a, b are called the parameters c 
Distribution f” is 


Fix) =~ 


w’2 (0) = 


Ho = 


12.1.2. M.G.F. 
M(t) = 


MATHEMATICAL STATISTICS 


Special Continuous Distributions 


(Ans. 10/2°) 


=x}= p, where does not depend 


X)} 


_ Does My(t) = E{My)x(0)} 12.1. Uniform Distribution 


‘oes FE. fy IX< . pee eer It is a continuous dist. given by 


is given by 


3) /2\, 


dF = 


dx, asx<b 
b-a 


a, b are called the parameters of the dist. We have a < b. 
Distribution f” is 


1 x 
ility density function fy y(x,y). Let Fx) = 377 J dx 
sly. Then show that : 
: x-a 
V(Y)|X = x}. ~ bea 
+1=3) of two random variables X 12.1.1. Mean and Variance 
¥ = E(x) 

b 
pa sa 
2,0<q and 0<p+q<l. Find: b?-a* beta 

| ~ %b-a) 2 
1 obtain its expected value. 1°28 
W'2(0) = = J Pax 
(See section : 10-8) 3 ; 
goa eee 
3(b-a) 
7 b? +a? +ab 
3 


Me = p’2(0)- x? 
b? +a2+ab a+b 7 
= 3 -(#*) 
_ (b-ay 
12 
12.1.2. M.G.F. 
M(t) = E(e) 
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bh. 
1 
fag 
= e- dx 
b-a J 
a 
el _ el 
‘  t(b-a) 


12.2. Gamma Distribution , 
Let x be a continuous random variate with probability density function defined as below : 


fe i: 
fx) = Ta” ee amare Yea 0<x<e 


x is called Gamma variate with parameter A and is eteted to as a (A) variate. The 


distribution of x is called a Gamma distribution. 


12.2.1. Moments about mean and 8, ¥ co-efficients 


w’.(0) = EQ’) 
if ri*« A-1 
z —— f a 
(A) , xe xX 
_ ted (A+r)-1 yy 
ra 4 
1 
= Fay Pen 


= (At+r—-1)(A+r—2)... (A) 
mean = ’)(0)=A 
W20)=A(A+I) 
11’3(0) =A (A+ 1) (A+ 2) 
WO) =A (AF 1) (A #2) (A +3) 
M2 = W'2(0)— {u's (0)}? 
= MAt+1)-A? 
- eh 
Ms = W’s(0)+3 py’ (0) p's (0) + 2{u', (0)}° 
= MA+1)(A+2)- 307A + 1) 4+ 207 
= 434302 420-323-302 4203 
= 242 
Ha = ps (0)—44’3(0) y's (0) + 6y’2(0) {u's (0)}? — 3 {u's (0)}* 
= MA+1)(A+2)(A+3)- 4070+ 1) (A+2) +607 (A+ 1)-324 
= MA + 602+ 110 +6) — 407(02 +3042) + 604+ 603-3204 


= 3N7+6A 

_ mp4 4 
Bi = ue eo. 

_ Hy 6 

= >=3+- 
Bo 3 x 
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plicated TE 


% = Bi =- 


% = B-3= 
12.2.2. Mode 
Density function is 
fa) = = 
LO F 
f(x) = 0 
for x = = 0, f(x) = 0 which is mini: 
forx = A- 
Mode = d- 
12.2.3. M.G.F. of Gamma Distribu 
M(t) = Et 
a 
“TI 
ce 
> TT 
Mj (t') = Ef 
= Pa 
= e 


12.2.4. Cumulative Generating Fu: 


Ko(t) = log 
= —) 
= x: 
a k,(0) = A, 
In general, 
te = Co 
r! 
4A 
r 
=> k, = ( 
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y density function defined as below : 


0,0 <x <0 


referred to as a (A) variate. The 


\ +3) 


2{u’; (0)}? 
1) +20? 
-223 


h’2(0) {11's (0)}? - 3 {u's (0)}" 
(A +1)(A+2) +60 (A+ 1)-30' 
°(7 + 3042) +604 + 623-304 
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2 
= B = 
Ni vBi i, 
6 
MS peas x 
12.2.2. Mode 
Density function is 
1 
= x JA~1 
SR) T(a) e*.x 
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, 1 —x \A-2 
PO = Faye PG-D-3 


S'(x®) =0 > x=A-1,0 


for x = 0, f(x) = 0 which is minimum value of f(x). 


forx = A-1, f(x) is maximum 


12.2.3. M.G.F. of Gamma Distribution 


1 1 
= Ty) = 
T(a) (1-z)* Be (1 
M;(t') = Efe™™4 
= &*' M(t) 
ae tidal ¢ Oe) ok lt1<1. 


Mode = A-1. 
Mot) = Efe*}. 
= re ee -! dy 
= —x(I-t) y A- l oy 
Fay fe 


Sale! 
= FO) op le 


12.2.4. Cumulative Generating Function and Cumulants 


In general, 


Ko(t) = log Mo (2) 


= -Alog (1-9, t|<1 
=X fade a 
2 3 


k,(0) = A, ky = etc. 


— = co-eff. off 
r! 
_ i 
: 
k, = A(r—D)! 


ee lady, 


where y=(1-12)x, |t{}<1 


pre 
-t)* 


486 MATHEMATICAL STATISTICS 


12.2.5. Additive Property of Gamma Variates 

Theorem. The sum of any finite number of independent Gamma variates is a Gamma 
variate. 

Proof. Let x; x2 .... x, be independent Gamma variates with parameters Ay Ag ve An 
respectively. 


Then Mo (t) ofx; = (L-1* (i=1,2...7) 
Let X = xp t XQ $M 
Then M(t) of X = E {el Ft iy 4 


= Efe"). Efe*2}....E{e"™} 
= (-)™!. -a72....a-97"" 


| 
om 
= 
~ 
—. 
mk 
Rogar 
+ 
y 
>» 
3 
V 


which is the m.g.f. of a (Ay + .... + An) 
X isay (Ay +... + An): 
12.2.6. Limiting Form of Gamma Distribution 
Theorem. Show that as 1 — o, Gamma Distribution tends to normal distribution. 
Proof. Let x be a y(A). Then X =A, var (x) =A, 
x-A’ 


L : = 
et Z Th 


I 
ty 
% 


Mo (t) of z = 


7) 
—~= |x 
= eh Eg i x ; 


me os . x 


a eRe 1? r 
= —-wvAt+ +—+——= 
Va 20 rer 


1 . P : 
5 ? + terms containing A in the denominator 


log {Mo (t) of z} 


1 
as as A 00 


L2 
Mp (t) ofz > au ash — 0 
which is the m.g.f. of a standard normal variate. 
*, In the limiting case, z and hence x is a normal variate. 
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12.3. Beta Distribution of First Ki 
Let x be a continuous random vari 


f(x) = Bt 


x is, known as a Beta variate of firs 
B, (m, 7) variate. 
The distribution of x is called Be 


12.3. I; Moments and Harmonic M 


uw’, (0) = EC 

~ BK 

~ Bl 

oe 

age 

rT 

ei 

7 | 

Mean = wl’ 

W'2 (0) = re 

Ho = We 

zc 

The harmonic mean H is given | 

eek 
H 

“2B 

“8 

f 
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12.3. Beta Distribution of First Kind 
Let x be a continuous random variate with probability density function defined as below : 


f(x) = aah 


x is known as a Beta variate of first kind with parameters m and n. It is referred to as 
By (m, n) variate. 
The distribution of x is called Beta distribution of first kind. 


pendent Gamma variates is a Gamma 


x t n-1 
: 1-x > <x< 
variates with parameters Aj, Az ... An ( yo ym,n>0,0<x<1 


(§=1,2...n) 


12.3.1. Moments and Harmonic Mean 


w’,(0) = E(x’) 


= aa (1—x)"" 1 de 


= ronE. erly! de 
B(m+r,n) 

B(m,n) 

T(m+r)U (m+n) 
T(m+n+r)T(m) 
(m+r-1)(m+r-—2)....(m) 
(m+n+r-}).....(m+n) 


bution tends to normal distribution. 


’ 


Mean = p’; (0)= 

m+n 
; : m(m+1) 
a a (m+n)(m+n+]) 


Me = wr (0) {1'1 (0)}? 
—_m(m+t) fm q 
7 (m+n)(m+n+}) m+n 


mn 


ia (m+n)? (m+n+l1) 
The harmonic mean H is given by 


a) 


Te n) 


tr P 
oe no Se 
2A 3AWA 


itaining A in the denominator 


1 ‘ 
pate xml (l1-x)""! dx 
x 


x 2q-x)"! dx 


aac om 8 


a n) 4 
as B(m—-1,n) 
B(m,n) 


nal variate. 
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m+n-1 


m-1 

m-1 
A=: 
m+n-1 
12.3.2. Mode of Beta-distribution of first kind 


We have 


f(x) Bumn) 
Ifm<1: 
f(x) asx > 0° 
.. x =0 is the modal value. 
Ifn<1: 
Ix) > 2 asx > 1° 
*, x =1is the mode. 
Ifm<1;n<1: 
both x = 0, 1 are modes 
Ifm=1, n=1: 
1 
f(x) B(m,n) 1 
.. Each value of x is modal value. 
Ifm>1, n=1: 
m-] 
RY = BGm,1) 


Here f(x) increases as x increases. 
fis maximum at x = 1 
x=1 is the mode 
fm=1,n>1: 


_ (i-x)""! 
f= BCA) 


F(x) decreases as x — 17 
f is maximum at x = 0 
Ret x =0 is the mode. 
fm> i, n>1: 


I 
f(x) = BGAn) 


' _ 1 
f°) = B(m,n) 


= 21a? (m1) (1-2) - (0 8} 


B (m,n) 
Mode is given by f” (x) = 0 
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T(m-))T (m+n) 
T(m)T (m+n-1) 


xt! a -xy"")) 


xt! (i=xy 


{(m—1) x"? (i-x)"7! -(n-1) "7! (1-x)""?} 
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Se 


=> (m-1)(1-x)-(-1)x: 


= m- 


m+n 
(Obviously this value lies in (0, | 
Ex. 12-1. [fx has a By, (m, n) di 
Sol. Let x be a Beta variate of fi 


Then E(x) = 


m+n 

=> EQ‘) = 1 
which is not asn >0 

: E(\i/x) # 1 


12.4. Beta Distribution of Second ] 
Letx bea continuous random vari 


FQ) = BF 


x is known as Beta variate of secon 
B> (m, n) variate. The distribution of 
Remarks : If x is a Bz (m, n) vai 


] 
yr re 

is a By (m, 7) variate. 
12.4.1. Moments and H2rmonic M 
W’-(0) = E 


Mean = 1’; (0) 


pw’, (0) = — 
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LLL LLL ELL I, 
} => (m-1)0-x)-(n-1)x=0 
) m-1 
m+n—-2 


(Obviously this value lies in (0, 1)) 
Ex. 12-1. [fx has a By, (m, n) distribution, can E(1/x) be unity ? 
Sol. Let x be a Beta variate of first kind with parameters m and n. 


Then E(x) = 


mt+n-l 
= E(x) = 1 iff n=0 
ae O0<x<1 m,n>0 which is not as n > 0 
E(\/x) #1 


12.4. Beta Distribution of Second Kind 
Let x be a continuous random variate with probability density function defined as below : 


| xm! 


LO) = Bonn) (aay 


x is known as Beta variate of second kind with parameters, m and n. It is referred to as 
B2 (m, n) variate. The distribution of x is called Beta distribution of second kind. 
Remarks : If x is a B2 (m, n) variate, then 


1 


y T4x 


m,n>0,0<x< co 


is a B, (m, n) variate. 
12.4.1. Moments and H2rmonic Mean 
w’-(0) = E(x’) 
1 


Rev) 
ant 
3 
= 
~ 


ao) 
os 
3 
= 
— 


i 
Sn 8 Ot 8 Ot 8 
= 


B(m,n) 4 eS) ates Gee) sad 


T(m+r)P(n-r) 
P(m)T(n) 
(m+r—-1)(m+r-—2)....(m) 


(n-1)(n—-2)....(n-r) 


yr! 
2 xy"! (nt) (1- 2)" 


Mean = pw’; (0) 


Il 


y"-? (m= 1) (1-2) -(n—1)3} a 
: m(m+1) 


HW’, (0) = (n—1)(n—2) 
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He = W’ (0)- {11 OY? =a 
2 
i Put ox = 1 
(n-1)(n-2)  \n-1 ; 
_ m(m+n-}) = ee ¢ 
(n—1)? (n-2) oe, 
The harmonic mean H is given by  ¢ 
oe : 
H x 
pees ey ae oc pat 
B(m,n) . x x(l+x)"*" ia 
1 2 (m1) H2 (0) = 
Bimn | (ex OD a var (x) =} 
1 = 
= Ba ; 
M(t) =] 
T(m-1)T(n+1) 
~ — T(m)T(n) on 
_ 0 
~ m-1 
H = m—1 = ( 
n 
12.4.2. Mode of Beta dist. of second kind _ 
1 xm! 
= .————., m,n>0, 0<x<0 _ 
f(x) B(m,n) dea > mn x = 
f’ 1 (m-1)x™~? (14x)"*" —(m+n)x™"! Gexrrt eee ae er X, be indepen 
x) = Qe ee ee respectively, then 
B(m,n) dagen 


x=) 
Put f’(x) = 0 


=p (m-1) "7 (14 "(nt nx ty"! =0 


=> x” (1+ x)"*"—! ((m-1) (1 +x) -(m +n) x} =0 


has exponential distribution with p 


Proof. We have 


Neglecting x = 0, P(x sX) = 
(m—1)(1+x)-(m+n)x=0 ~ 
m-1 7 
> x= : = 
n+l 
This exists only when m > 1. 
> = 
12.5. Exponential Distribution Now yea 


It is a continuous distribution given by 
dF = ae™,x>0,_ 
where o is parameter and a > 0, x is called exponential variate with parameter a, we have 
w’-(0) = E(x’) 
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m-\ 


txymen 
m—-1)-1 


m-l+(n4+h) dx 


n+1) 


), 0<x<00 


n+n —(m+n)x"7! daa"! 
(eae? 


+x)"*"-! =0 


-(mt+n)x}=0 


l variate with parameter o, we have } 
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oO 
= a. f x e™ dx 
0 
Put ax =t ; 
dt 
=> dx = — 
a 
1 ioe] 
=—fteta 
al 0 
r! 
= a 
Se, 1 
x= p1(0) = — 
a 
Fe 2 
2 (0) = 7 
a 
var (x) = 1’ (0)- x? 
Se ee 
az a2 a 
M(t) = E(e™) 
foe} 
= a. f ee dx 
0 
ioe] 
= a f ef OF chy 
0 


Qa = oO 
—_ | elt | 
t-o 


a. 
= —., ift<a. 
at 


12.5.2. If x1, xo, ..... Xn be independent exponential variates with parameters 0), Ol, .... On 
respectively, then 
x = min (xX, X2, .... Xn) 


“ n 
has exponential distribution with parameter do. 
i=l 


Proof. We have 
PQxsX) = 1-P(x>X) 
= 1-—P {min (4, x2, ... Xn) > X} 
= 1-P {(x%; > XVvi 
= 1-P(x, >X). P(x. >X).... P Xn >X) 


(.” x;’s are independent) .A1) 
Now P(x;>X) = 1-P OSX) 
x 
= [- J QO; e Tix dx; 
0 


= got 
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-0, X eo t2X —an,X 


(1) => PSX = 1-e 
= [eux 
.. Cumulative distribution f” of x is 
FQ) = 1-7 ais 
’, Density f” of x is given by 
£Q) = F(x) =a) OW 
", x has exponential distribution with parameter 


¥ a,, 


12.6. Double Exponential or Laplace’s Distribution 
It is a continuous distribution given by 


1 |x-a| 
df = io {- | -wcx<e 
aa B 
where B > 0 
a, B are called parameters of the distribution. 
We have 
yw’ (0) = E(x’) 
1% _|x-a| 
=— f xe & dk 
2B -., 
te x-a 
UW — 
B y 


= 5 J @+py'e May 


0 ro) 
= ; J (a +By)’ e” dy + f (a+fy)’ e” dy 
—00 0 


For first integral, change y to —y 


= of +’c) a"? B72! + "cya * BA A+... 
= o +2! "cy a7 B2 +4! cy 4 BA +... 
Put r=1 
_ xX = pi (O)=a 
r=2 


Put 


es 
2 
roe) 
= f {a" + "cp a’~2 B? y+ "cy a4 Bt ye. } e” dy 
0 
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eS 


12 (0) = a 
= oe ms 


var (x) = Wp 


o- 
Ss 2p? 
12.7. Lognormal Distribution 

Let x be a positive random variat: 

y = log 

Then, ify is normal variate, x is ca 
lognormal distribution. 

If y is M(m, 0), dist. of y is 


dP = — 
On 
Dist. of x is given by 
dP = — 
On 
a, 


12.7.1. Moments, Mean and Variar 
Wr (O)ore = EC 
= Et 


n 
= @ 


x =p’, (0) = > 
w’, (0) = e 


var (x) = BW’; 


12.7.2. Theorem : 
If x1, X2, ..... X, be independent lo 
0; then | 


X= XY 
is also a lognormal variate. 
Proof: Let y; = log x; 
Then y; is M(m;, 0), V;- 
Let x=x 
. logx = lo 
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: -A, X 

ee On 

ux 
I 
‘ 

n 

*,-co<X< 00 

kk 

dy 


dy + J (a+By)’ e dy 
0 


dy + f (a+BPy)’ e~” ‘| 
0 


t+ By)"} e* dy 


‘y +loyol 4 Bye te? dy 


Tegal * Bt 4! + 
4! "Cg a4 Bt + 
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WW’, (0) = of + 2! c B’ 
= + 2p? 
var (x) = pty (0)— x” 
= o2 + 2p’- o2 
2p? 


12.7. Lognormal Distribution 
Let x be a positive random variate and 
y = log.x. 
Then, ify is normal variate, x is called lognormal variate and distribution of x is called 
lognormal distribution. 
If y is Mm, ©), dist. of y is 


,  -2(22y' 
dP = eek, ee 


Dist. of x is given by 


' ~1(teaeom) 
2 o 
= e d(log x 
dP a (log x) 
\ 1 (gsm! 
= e : ad —, x>0 
oVv2n 
12.7.1. Moments, Mean and Variance 
HW’, (O)orx = E(x’) 
| = Ee”) 
= Mo ("ory 
pape e? 
=. 2 2 
m+—o* 


x ='1(0) = e 
2 (0) = e m+ 202 
var (x) = pi’, (0)- x? 


2 2 
a ezmt2a _ g2mta 


. amo" (” -1} 
12.7.2. Theorem: 


IfX1, x2, ..... X, be independent lognormal variates such that log x; has mean m; and s.d., 
G; then _ ; 
X F XjX2 0. Xp 

is also a lognormal variate. 

Proof: Let y; = log x; 

Then y; is N(m;, 0), V;- 

Let X = Xp X70 Xp 

: logx = log x, + log x, +..... + log x,. 
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= VF V2 veers +n 
By additive property of normal variates, log x is a N.V. with mean 


n n 
> mj and s.d. pp oF 
i=] i=] 


x is lognormal variate. 


12.8. Cauchy Distribution 
It is a continuous distribution with probability differential 


igs 
ee 
r 


A; pt are the parameters. 


dx, ~0<x< 0, A>0 


Substituting z : P= z, (12.8-1) takes the form 


: 1 dz 
dP = irae i —o <Z< oo 
m 1+z 


(12.8-1) 


(12.8-2) 


This is standard cauchy distribution and z is called standard cauchy variate. 


Distribution function of (12.8-1) is 


F(x) 1 7 dx 
x)= -> 
1X “=14(4) 
x 
Put xTH = Zz 
rx 
x-y 
ao eae 
Ny 1+ 27 
1 17 
= - x 
— {tan z} 
tT x 2 


For various constants of (12.8-2) sec. Ex. 9-28. 
12.8.1. Moments , 


=H) = oo i xdx 
(x) 7 1X 0 4 (4) 
7 ae 
Put a = Z 
dk i (Az + p)dz 
ud 1427 


—-2@ 
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ee 


where 


=i 


sain 


Now. in general / does not exist 


exists and is zero. 


Lin 


ta 


In general, x does not e> 
(12.8-3) => x 


Put 


H2 


> 


= |. 


which does not exist as the integral 
[tz does not exist. Simila 

12.8.2. Median and Mode 
From (12.8-1) eq. of prob. der 


y 
x—H 
Put x 
y 
which is symmetrical about 
z=0 ie. 
median is 
Also obviously y is maximum 
, Mode is at z 
ie., x 


Mean, mode and media: 
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A 


1% 2d 1° 4 
ane wane = 


gx isa N.V with mean 


=—/]+p (12.8-3) 


fferential 


J zdz 

x, -o<x< 00, A>0 — ...(12.8-1) where [= 2 

Now. in general / does not exist but its principal value 
2 

Lin f 5 


pope le 


dz 


exists and is zero. 
In general, ¥ does not exit but if the principal value of / is taken 
(12.8-3) => xX =u. 
Wy = E(x-py 


'Z< 00 .(12.8-2) 


id standard cauchy variate. 


NY 
N 


Put =Z 


> 


which does not exist as the integral on right-hand side is not convergent. 
Ly does not exist. Similarly 1, (r >. 2) does not exist. 
12.8.2. Median and Mode 
From (12.8-1) eq. of prob. density curve is 


1 1 


Dee ne Ae, 2 
Tr a 
lew. 
nV 
Put ae = 
ae 
WZ ~ TA 1+22 
} which is symmetrical about 
z=0 ie. x= 
median is x= 


Also obviously y is maximum for z = 0 
: Mode is atz = 0 
L.e., x= 
. Mean, mode and median coincide. 
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12.8.3. Characteristic Function (e.f) 


From (12.8-2) c.f. is 
o(t) 


Consider 


I 


el 


ett 


Now, for (12.8-1) c.f. is 
10) 


Ece'”) 

1% 1 

i. oe ne dz changing z to—y 
or) 

1 f ~ity 1 

T J I+y 

fo 9) 

jf e” eMay 

-2 


J {00s (ty) + i sin ()} & dy 


~~ 


[-) 
2f cos (ty) e” dy 
0 
) | sin ty te" 
t 


+ a i} e~” sin (ty) dy 
o °hi5 


ele | e* onion 
t 
0 0 


ae ae 
or ~ a | © cos) ay 
0 


eae 
ane 
ae 
142? 
1 ~ity 2 
— é — 
2n 1+? a 
2 i SID oP 

2 
T l+t 
Liye iF 
S a ” re = o(4) interchanging ¢ and y 
E(e*) 
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nrc 


Put = Z 


12.8.4. Additive Property 
Let xj, x2, .... X, be independer 
(An bn) respectively. 


Then br; (t) 
x 


Let 
ox (t) = 


iil 


i 
a 


an | 
By uniqueness theorem o 
x is a Cauchy Variate 
with parameters (A; +.....+ An, Hy 


12.9. Truncated Distributions 


Let x be a random variate with 
Then density f” of x truncated 


Fil 
Ex. 12-2. Let x be normally a 
density of x on the left at ‘a’ and on 
distribution. Furthermore, ifa = 1- 
is m. % 
Sol. Dist. of x is 


dP 


f(x) 


and F(x) 
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changing z to —y 


ly 


(t»)} el dy 


~ 


z J e~” sin oa] 


0 


Ca eel 


f e-* cos ra 
0 


ty) dy 


dt 


‘ O(f) interchanging ¢ and y 


and FQ) = Tin f e? 
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a 


Put 


1 f oit(u+dz) dz 
2 
nN wy 1+z 
oO 
- etn 1 f elar, dz 


us 1+z? 


a) 
= gilt e Al es elt lel 


12.8.4. Additive Property 
Let x1, X2, .... X, be independent cauchy variates with parameters (Aj, [11), (Az, [12) «... 
(An Hn) respectively. 


Then (0) = elthi- Auld 
Let X= xXx t... +H 
ox () = Efe'™} 


Et elt . ela 3 olny 
= E(e!™), E(e!2) ... E(el) 
$5, Ox (Dds, 


= elt( Font Wy )—(Ay Feet Ag fel 


By uniqueness theorem of c.f.s. 
x is a Cauchy Variate 
with parameters (A; +.....+ Ag, Py to... pq) 


12.9. Truncated Distributions 


Let x be a random variate with density f" f(x) and cumulative distribution f” F(x). 
Then density f” of x truncated on the left at x = a and on the right at x = b is 


aot) ,asx <b. 
F(b)- F(a) 
Ex. 12-2. Let x be normally distributed with mean m and variance 6°. Truncate the 
density of x on the left at ‘a’ and on the right at ‘b’ and calculate the mean of the truncated 
distribution. Furthermore, ifa = \1—o and b = 1+ 6, then mean of the truncated distribution 


is m. 
Sol. Dist. of x is 2 
a=) 
dP = = e 2 o dx —00 <x <0 
ov2n 
\(zmy! 
2\ co 
= e 
F(x) ov2n 
i 7 Ee: 
o 
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lf{x-m e lfx—m 2 7 
1 2 il } dx 1 o -5{ } Ex. 12-3. [fx is a-N(m, 6), then 
F(b) - F(a) = fesse s fers 
oV2n ss oVv2n 1, -— be 
1fx-my 2 
eee als) 
Ose a way(5). 
x-m : 
Put ee Sol. Distribution of x is 
plea 12 
F(b) - F(a) = ES Gk cl (1) 
OIRO ig cn : aaa 
Density f” of truncated distribution is 1 
a xe Put _ 75 3 
F(b)— F(a) 
(scm) => x =m 
1 e 2\ 6 
_ ov2n d= 
F(b)- F(a) ee 
Distribution of z is 
and asxsb. o dP =e. 
". mean of the truncated dist. is where c is constant to be obtained s.1 
ioe) 
2 
1 = = 
eee: Sees ie ae) ss Lee 1 
ee) ov2n a cis given by 
x-m oo : 
mm o> cf err? dz = 1 
0 
b-m 1 > 
My ou ae 
= OO m+oy).e d cl = 1 
F(b)—F(a) Vin Jacy, et) - ue 2 
Go 
1 1 am 2 pom 2 L@. CS 
o ays oO o Sarre ’ 
= —— YI e? dy +—-|--__y e 2 dy 
F(b)— F(a) | Von is aan feos 
Distribution of z is 
b-m 
1 o -37 S dP = - 
= m+—— ———— 5-2 
V2n F(b)- F(a) <— 
o 
. ee 1 
foamy fie 2 which implies that z is a Y ry ‘ 
ck oe ai oa ena mle) Sue) 
J2n F(b)- F(a) Ex. 12-4. Ifx is a(A), find E ( 
variate. 
which lies between a and b Sol. Distribution of x is 


Ifa=m-—oandb=m+o then 
m-a = b-m=o 


dP = 


x = mM. 
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Ex. 12-3. [fx is a. N(m, 0), then. 


al 
2 


t 
isay 2 . 


Sol. Distribution of x is 


x- 
o 


my 


(xa) 
dP saa e c dx —co<x< © 
eS | 
ut z= 5 a 
=> x = m+ovV2z 
dz 
dx = i Gia 
2Vz 


Distribution of z is 


dP = ce? 72!" dz,0<z< 0 
where c is constant to be obtained s.t. 


fap=1 
0 
c is given by 


. ; 
cf e7z dz = 1 
0 


: (3) 
Le, cl 2 


Ul 
_ 


Distribution of z is 


dP = 


: 1 
which implies that z is a y (5). 


4 


TaN & 


zl? dz, 0<z< « 


Ex. 12-4, [fx isa yA), find E (vx ) . Deduce mean deviation about mean for a normal 


variate. 
Sol. Distribution of x is 


dP = 


1 


é 


(A) 


tT! dy, 0<x<0, 
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me T (a + ;| 
T() 2 
Deduction. Let x be a N (m, 0). 


Then z= 


Now | Ix—m| = of2z 
, Mean deviation about mean. 
= Elx-ml 


o 2.E (vz) 


= of, HO 
= O./—-: 


(3) 


Ex. 12-5. [fx and y are independent gamma variates, find the distribution of 


2 


(i) xty fr oe (iit) =. 

xt+y y 
Sol. Let x and y be gamma variates with parameters 4 and ut respectively. 
Then distribution of x and y are 


1 


Ze -x JA-1 <0 
dP T() e°.x” dx, O<x 
1 
dP = —— e” yo! dy, 0<y< oo 
T(u) 
Since x and y are independent, joint distribution of x and y is 
1 (x+y) JA-1 opel 
dP = ————-e 2x yh" dx- dy 
PA)T(u) 
P =xty, ve— 
ut u=xty, v= x+y 
Ou Ou 
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DiS ee 


Now x= uw 

Joint distribution of u and 
eet ee 
PAT (hu) 
eS 
P(A) (pw) 


dP = 


= 1 
~ | TA+p) 


=> uand vare independent var. 


To find dist. of as proceed as be 
y 


Put u= x: 

e 

O(u,v) _ iC 

Axy) | é 

¢ 

t 

Now x= T. 
O(u, Vv) 
O(x,y) 


.. Joint distribution of u and vi 
fs 
—————- e 
P(A) (uy) 
= = oe ee .€ 
PA)r(H) 


dP = 


= J e! 
TA+p) 
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1 1 
ol dx = y.- -Xx 
(x+y)? (x+y)? 
3}-1 ai, (x+y) tae pik 
2 dx (x+y)? 7 x+y u 
Now x = uv, y=u(1—-yv) 


Joint distribution of u and v is 


dP = eSTeTT) e" (uy)*" {u(1 = y)}to! u. du dv 


I 


te A es Aeund ~-1 _ tol 
Far ° (u) ve Ng du dv 


= 1 —u  At+p-1 1. Aly a} 
(gine wu ai} {oe (l-v) | v 


=> uand vare independent variates. u isa (A+ pL) variate and v is B,(A, L4) variate. 


To find dist. of a proceed as below: 
y 


Put u=xty, me 
y 
ou u 
Seg d(u,v) ax ay 
ates, find the distribution of Bxy) ~ 1a av 
= Ox Oy 
y 
ors X and wl respectively. 1 1 
= |1 x 
yoy" 
7 O0<x< co 
x+y 
at aoe od 
‘ O<y< °° y 
N eS a 
fx and y is ne a Gage bay 
xh yl a. dy Che (1+)? 
0(x,y) u 


.. Joint distribution of u and v is 


A-1 pu-l 
dP = ae e” al pane eeeey du , ane 
TA)T(y) (4y l+y (1+)? 
1 A-1 


“Qn Gaga ee 


1 -uo) At+u-l 1 wee 
= eu du ———.—- dv 
T(A+n) BA,H) (1+ v)At™ 
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=  uand vare independent variates 
uisa "A+ UW) variate 


and vis By (A, [4) variate. : 
Ex. 12-6. [fx andy are two independent standard normal variates, find the distributions 
x? : 
of (a) x (ii) r+y (iii) =. 
y 
Sol. (i) Put u=x 


Then : uisay (5) variate. 
2 2 


Dist. of a u is 
2 


which gives the distribution of u. 


ee ane (5) ; 
i — x isay|— | variate 
(ii) SRS 


l o53 (5) : 
— y isay| — | variate 
2 2 
| ra eee ee E 1 } ; 
—(X*+ is a —+—=1] variate. 
ra ¥) mer 
Dad 
Dist. of = = is 
. Po weer ee Hae 6 ae ee ee 
~Z(x"+y") 
Abie eo eno Ae aie | eae 
r() 2 2 
2 
aay 
= ze dw? 
where Ww = x+y’ 
wine Age, | . 
ul —-x 8a — } variate 
(i). (5 
u Pisa (5) variate 
pe 
15 
_ 3% ae iA 
u = ear 2 isa Bo 29 


Ex. 12-7. [fx and y are gamma variates with parameter Xand \. Find the distributions 1 


ofx+yand = 
x+y 
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Sol. Let u=xt+yv = = 
X- 

u( 

x= 

a 

Axy) 16 

uv) | a 

] 

d 

=| 


Now distributions of x and 


dP = — 
dP = = 


The joint distribution of x and yi 


dP = Tr 
The joint distribution of u and v1 
1 
dP = ———— e" 
P(A). Pu) 


T(A +p) 


1 
Faue B(A,H 


.. wand v are independent and th 


dP = ~ 
dP == 


Ex. 12.8. Let x be a By (4, B), fi 
Sol. Dist. of x is 
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dnormal variates, find the distributions 


‘ameter Nand w. Find the distributions} 
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av : 


_ u(l-v) 
es 
us 
7 eI 
ul 2 
2 

Kl 

pl 
Sa 2 


Sol. Let u=xtyv = —— 
x+y 
_ u(l+yv) 
eas 
: Ox Ox 
OY) _ | au 
REY NO Oy 
Ou ov 
Ley 
em 2 
l-y 
2 
Now distributions of x and y are 
1 -x 
= ——e 
err 
dP = oo 
Tu) 


The joint distribution of x and y is 


1 


dP. = 

P(A).P(u) 

The joint distribution of u and v is 
A-1 
ape =_ on jac} [Mou 
T(A) . F(u) 2 2 
= : et gh teat 
T(A+p) 


1 
Fae B(A, 1) 


e FAY) yA-T HHT iy dy, 


p-l 
} .dudv. = 
2 


dav evi wh 


.. wand v are independent and their distributions are 


dP = 


1 


T(A+p) 


dP = 


1 


2**H-l Bau) 


-u yrthol du 


Ex. 12.8. Let x be a By (A, UW), find E(vx). 


Sol. Dist. of x is 


dP 


= 1 
BA, H) 


xl -x dx, O<x<1 
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at+vt-'d yw! dy respectively. 
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oe 
E(Vx) = ar ie x2. x47! Qaxytol dx 
1 l nett 
= fx 2 d-xtl dk 
sd 0 


as 1 
ra+s}raew +5)ro+w 


rayr(s +u+ rayr(.+u+t) 


Ex. 12-9. Ifx and y be independent standard cauchy variates, find the dist. of xy. 


Sol. The joint dist. of x and y is 


1 
Fe caecee | Loca eer 
Tm l+x mT 1t+y 


dx dy 


m2 (1+x2)(1+y”) 
Put u=xy, v=y 
Both u, v vary from — © to + 0 


u 
x =-,y=v 
1 - 
bey 
as a a bam 
0 1 s 
Joint dist. of u, v is 
Gre a es 
u? lv 
Jae 
v 
1 |v| 


Ste did 
we (w+ y\(l+v) 


Dist. of u is 
dP = “ft. 2 ee 2 
T (u* +v*)(1+v") 
2 age vdv 
0 (w+v?)(1+v") 
Put ee 


saad | 2 oo 
ne 70 (u? +2)(1+2) 


SPECIAL CONTINUOUS DISTRIBUTIONS 
ee 


_ fu 
n 
oe 
oe? 
= 3 
1. If x), x2, ..... X, are gamma vai 
om Xp tXQ+..... +Xp 
n 
2. Fora Beta distribution of first k 
(ij) A.M. >H.M. 
ii) log G = ———~- — ‘ 
(ii) log ve u) oR o Ba 
3. IfX), Xo, .... X, are independent 
Y, = Ma 
Zn = Mi: 
Then c.d.f. « 
Fy, (vy) = {F. 
FE AVY Le 
respectively. 
Sol. (i) Fyy): = PR 
= PL 
Ul 
7 i=l 
a 
: i=] 
(ii) Fz, (y) = Pt 
=> 1 _ 
= |- 
=> 1 _— 
= ]- 
= 1 _ 
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Tl qaxyhol dy 

I -1 
(1-x)4™" dx 

“9 4) > 
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—_— 


du;o 1 1 1 
n if asset | 


» du re ur +t ‘s 
me(l-w2)| | 140 
0 
du 2 
‘i ie log u | 
ee ae log |u| 
n? (u? -1) 
EXERCISES 


« If x1, X92, ..... > Xn are gamma variates each with parameter A, find the distribution of 


. For a Beta distribution of first kind, show that 


() AM.>HM. 


(it) log G = o Ba, H). 


7 1) or 


3. IfX1, Xo, ..... X, are independent random variables with same c.d.f. Fy (.) let 


Y, = Max {X, X9, ..... Xn} 
= Min {X1, Xo, .....X,} 
Then c.d.f. of Y, and Z,, are given by 
Fy, 0) = {Fx(y)}" 
Fz,(y) = 1- {1-Fr()}" 
respectively. 


Sol. (i) Fy (vy) = P{¥, Sy} 
= PEX Sy; X2 SVyy es Xn SY} 
nt 


= I P(X; S y) (CL X sare independent) 


= Hl Fy (y) = {Fx ()}" 
(ii) Fz, (vy) = P{Z,< y} 


= 1—-P(Z,>y) 


N 


= 1-[] pcx;>y 
f=] 
= 1-I] {1-P(X; < yy} 


ae 1 {1-Fx(y)} 


= = a egipi 
QO00 


{13 


Correlation Co-efficient and Linear 
Regression 


13.1. Introduction 

In bivariate distributions there are two variates x and y. If the change in one affects the 
change in other, the variables are said to be correlated. Otherwise they are said to be 
uncorrelated. If the increase (or decrease) in one results the increase (or decrease) in other, 
the correlation is said to be positive otherwise negative. 


In bivariate distribution the data is of the form (x1, y1), (X2, ¥2) ---- Xn Yn). Each of these a 


value pairs can be represented by a point in xy-plane. The resulting set of points is called a 
scatter diagram. From the scatter diagram, one can get a fairly good idea, though vague, 
about the correlation between the variables. If the points are very dense (i.e., very close to 
each other) a fairly good amount of correlation is expected and if the points are widely 
scattered correlation is poor. 

As a measure of degree of linear relationship between the variates, co-efficient of 
correlation is defined. This formula is referred to as product-moment formula for linear 
correlation. It, being due to Karl Pearson, is sometimes called Karl Pearson’s correlation 
co-efficient. 


13.2. Covariance 
Covariance between two variates x and y is defined to be 


E(x—x)(y-y) (for prob. dist.) 
oF 5 3 fi i -*)Oi-Y) (for freq. dist.) 
i=l 


“where X and y are respective means and is denoted by ‘cov (x, y)’. 
Correlation co-efficient. Correlation co-efficient between two variates x and y is 
defined to be 


fees) ae 
(s.d.of x)(s.d.of y) 


and is denoted by ry, OF Px, y 


Sie zy > 
For freq. dist. Ty ese 


Spey - x (Sf) (2H) 


{za?- 5 c?} 18-5 cy} 


and for prob. dist. 


CORRELATION CO-EFFICIENT AND LINE. 
Se ee Ee 


13.2-1. Properties of Correlation C: 
(i) The correlation co-efficient is 
Sol. Let x and y be two variates - 

correlation co-efficient between them 
The transformations correspondit 

ae 
h 
where u and v are the variates to which. 
to the change of origin and / and & ar. 
Now x = at 

, x = at 


u = 


where i, v are expected values of u, 1 


Now Ty = Vi 


Ak 
\hh 
according as h and k are of same or 0} 
‘ Ir yl = Iruy 
(ii) For two independent variate: 


Sol. Let x and y be two independe: 


Now cov(x, y) = E{l 
= Eft 
= EV. 
= E 

Since x and y are independent, 

Etxy) = EQ 


cov (x,y) = x: 
ry = 0. 
Converse. It is not necessary that 
consider the following example : 


ba -3 —2 
y=x: 9 4 
xy: —27 -8 


Here Xx = 0 = XLxy 


ent and Linear 
on 


c and y. If the change in one affects the 
slated. Otherwise they are said to be 
ults the increase (or decrease) in other, 
ive. 
1, Y1)s (X2, V2) «++» (Xns Yn). Each of these 
2, The resulting set of points is called a 
get a fairly good idea, though vague, 
oints are very dense (i.e., very close to 
expected and if the points are widely 


: between the variates, co-efficient of 


as product-moment formula for linear 
imes called Karl Pearson’s correlation 


fined to be 
(for prob. dist.) 


(for freq. dist.) 


d by ‘cov (x, y)’. 


cient between two variates x and yis 4 


-j) 
(v-3y" 


: x (She) (SH) 


so} Lag? - 2 cp} 
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_  Ee-DO-7) 
VE(x-%) E(y-3)? 
E(xy) — E(x) E(y) 
MEQ)? (E071 EO)? —{E0)}7] 


13.2-1. Properties of Correlation Co-efficient 
(i) The correlation co-efficient is numerically independent of origin and scale. 


Sol. Let x and y be two variates with expected values ¥ and y respectively and r the 
correlation co-efficient between them. 
The transformations corresponding to change of origin and scale are 
x-a -b 
u= and vy = = 


h k 
where wu and v are the variates to which x and y transform, a and b are constants corresponding 
to the change of origin and / and k are constants corresponding to change of scale. 
Now x = atuh and y=b+k 
‘ X = at+uhand y=b+khv 
where i, Vv. are expected values of u, v respectively. 
Ei{@—-x)(y~Y) AkE {(u~u)(v~V) 
new 1 eer, en) eb g lve Sane ese 
E(x—-x)" E(y-y) h* k* E(u-u)*. E(v-v) 
hk 
|Ak| Tuy 
according as h and k are of same or opposite signs. 
: Irxy| = rw. 


(ii) For two independent variates correlation co-efficient is zero. 


=+ry 


Sol. Let x and y be two independent variates with expected values x and jy respectively. 
Now cov(x, y) = E{(x- x) (y-y)} 
= Etxy—xy~yx+x-y} 
= E(xy)-x E(y)-yE(x)+x-y 
E(xy)-x-y-xX-y+xX-y = E(xy)-x-y 
Since x and y are independent, 
E(xy) = E(x) Ey) =%-y 
cov (x,y) = *-y—-xX-y=0 
Ny = 0. 


Converse. It is not necessary that, ifr = 0, the variates are independent. To observe this 


consider the following example : 


x: -3 =3 = 0 1 2 3 
jon 9 4 1 1) 1 
xy: -27 -8 -1 0 1 8 27 


Here 2x =0 = Ixy 
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Say - x (2x)(Zy) 
= _e  -S 0 


om Me 2 ae 2 
(2x*) ac [By yee? 


(iii) The correlation coefficient between linearly related variables is ‘+ 1° or ‘- 1. 

Sol. Let x and y be the variates related by the equation y= mx+c and x,y be their 
expected values. 

Then y 


Vy 


mx +c 
_ _F(e-¥)(y- 9) mE(x- 5° 
E(x-x EQ-yy Vin? .E(x-®) 


Now 


m 
|m| 


according as m is positive or negative. 
(iv) The correlation co-efficient cannot numerically exceed unity. 


Sol. Let x and y be the variates with expected values x and y respectively. 


Let x’ = x-x and y=y-yp 

Now for any real constant ‘a’, (ax’ ~ y’)’ 2 0 

Since probabilities are non-negative and the sum of the non-negative quantities is non- 
negative, 


Eax’-y'P = LX pilax'i-yiy 20 


I 


a’ E(x’*) + E(y”) — 2aE(x’ y’) 20 


Put a= au 
E(x'~) 
an . Ey? 
Then EQ”) 2 ~E(r'2) 
2 
E(x'y’) E(x-x)(x-y) 2 


=Hlry 


Le, 12 pearl = 


rol S 1 or -1 Sry 1. 


VE(x-3)° E(y-y 
Ex. 13-1. Show that, ifx’, y’ are the deviations of the variables x, y from their means, 


2 
— 4 palate (Symbols have their usual meanings) 
F GO, Gy 


and 


Deduce that -1< r < 1. 


f a 
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CORRELATION CO-EFFICIENT AND LINE 

——————S oe 
: 1 

Sol. Consider 1 -— > fil- 

2N F ( 


1- 


u 


= as. 


Similarly second result can be p1 


l-r =r 
Now I >. 
: ieee 
an r ms 
-1s 


Ex. 13-2. Show that the co-effici 
expressed in the form 


o,6 


where x, y are A. Ms. and oy, Oy are 
Sol. By def. 


My 


o 
Note. For the numerical data r. 
new variates u and v defined by 
x 
eee 
where a, b, h and k are constants to b 
using the fact ryy = = Tey. 


MATHEMATICAL STATISTICS 


1 
y ee) 


2 2 1 2 
Ly. -—( 
] By raced 


: related variables is ‘+ 1° or ‘—1’. 
juation y= mx +c and x,y be their 


a mE(x-x)? 


“Hh BGES 


ally exceed unity. 


tlues x and y respectively. 


- 


of the non-negative quantities is non- 


>0 

2 
(x-y) og 
ap ary 
EY-y) 


of the variables x, y from their means, 4 


's have their usual meanings) 
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A 


Zz 
Sol. Consider 1 —- — 2 fi [= * 
Oo, 


2 
12 
= ane xi +i - Biot Xi Yi 
oN = a Oy, Oy 
-1-de (FE we)-e [te si 
20 a 20 vie 
me 
Nm 
ie 7 2, cov(x, ¥) 
— ; 7 s Oy Oy 
Similarly second result can be proved. 
1 2 
xi di 
bers 2 ee 2 
Now 1 1E (2 2| 0 => 12r 
1 £ , 2 
x; ° 
and eee EH [o dt) >0 => r>-l 
Ny Oy, Gy 


-Ilsrsl. 


Ex. 13-2. Show that the co-efficient of correlation between two variates x and y may be 
expressed in the form 


3 (v9) 


0, Oy 
where x, y are A. Ms. and 6,, Oy are s.d.5. 
Sol. By def. 1 
cov(x,y) x a y (x-x)(y-y) 
(s.d.of x)(s.d.of y) 0, Oy 


1 Sore cak vats 
{yy - yx + FF} 
Oy : 
= ey xy—nx-y — nx-y + nx-y} 
Oy 


= d {Ay 9-35} 


Oxo, ln 
Note. For the.numerical data r,y is calculated by changing the variates x and y to the 
new variates u and v defined by 


u= = Poy eee ae 
h k 


where a, b, h and k are constants to be chosen suitably so as to simplify the calculations and 
using the fact ryy = + Ny. 
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SSS ee phen 


Ex. 13-4. Find the co-efficient « 


Ex. 13-3. The ages (x) and systolic blood pressures (y) of 12 women are given below: + 


Ages in years (x) Blood pressure (y) 
56 147 
42 125 
72 160 
36 118 
63 149 
47 ; 128 
55 150 
49 145 
38 115 
42 - 140 Let pest 
68 152 
60 155 and v= 2 


Calculate the correlation co-efficient between x and y. Sol. Calcula 
Sol. Define the new variates 
u=x—52, and v=y-140. | 


Then we have the table of values as below : 


x y u v u v uv 
56 147 4 7 16 49 28 
42 125 -10 —15 100 - 225 150 
72 160 20 20 400 400 400 
36 118 - 16 —22 256 484 352 
63 149 11 9 121 81 99 
47 128 -5 -12 25 144 60 
55 150 3 10 9 100 30 
49 145 ~3 5 9 25 -15 
38 115 -14 -25 196 625 350 
42 140 -10 0 100 0 0 
68 152 16 12 256 144 192 
60 155 8 15 64 225 120 

4 4 1552 2502 1766 


1 
1766 - —(4)(4) 


— ; = 0-896. 
1552 - —(4)* ,[2502-— 4)? 
\ 12! y 12‘ ) 
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(y) of 12 women are given below : 


Ex. 13-4. Find the co-efficient of correlation for the following table : 


od pressure (y) 
147 
125 
160 
118 
149 
128 
150 
145 
115 
140 
152 
155 and v= = 
dv. Sol. Calculation of Co-eff. of Correlation. 
w v uv 
16 49 28 
106 225 150 
400 400 400 
256 484 352 
121 81 99 
25 144 60 
9 100 30 
9 25 -15 
196 625 350 
100 0 0 
256 144 192 
64 225 120 
1552 2502 1766 
(4)(4) 


_—<————_———————— 0-896. 


[905 
2502 -— (4 
oa ) 
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Se ge ar ee Se ee ee pele ca cc Ne De 


1 
1 aT cae 28) 


SS Ee 
,|210-— 54)? _ [286 - La Rye 
100 | ) i00 § ) 


Ex. 13-5. A computer while calculating ry from 25 pairs of observation obtained the 
following constants : 


n=25, Lx=125, Lx’ = 650, Sy = 100, Ly’ = 460, Lxy = 508. 


A recheck showed that he had copied down two pairs (6, 14), (8, 6) while the correct 
values were (8, 12), (6, 8). Obtain the correct value of the correlation co-efficient. 


Sol. n = 25 

Correct Value of Xx = 125-6-8+8+6= 125 

Correct Value of Xx? = 650 — 36 — 64 + 64 + 36 = 650 
Correct Value of Zy = 100 - 14-6 +12 +8 = 100. 
Correct Value of Zy* = 460 — 196 — 36 + 144 + 64 = 436 


Corrected Value of Lxy = 508 — 84 — 48 + 96 + 48 = 520 
.. Correct value of co-efficient of correlation 


n(Zxy) — (Zx)(Zy) 


\ndx? -(Ex)? Yndy? -(Ey)? 
(25)(520)-(125)(100) 


‘ (25)(650)—(125)2. (25)(436)—(100) 


r= 


Total 


520 — 500 20 2 


4/650 — 625 [436-400 5-6 3. 


Ex. 13-6. The following table gives the number of students having different hts. and 
wes : : 


Calculation of Co-eff. of Correlation. 


wts. in lbs. 


o;n 
fon) 
2 
N 
= 
o 
N 
ro 
2 
= 
~~ 
a 
Oo 
o 
a) 
S 
o 
i 
fw) 
nN 
fw) 
a 
oo 


hts. in inches 


Calculate co-efficient of correlation between hts. and wts. 


Sol. Let x and y be the variates for the wts. and hts. respectively. 
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54)(—28) 
wo = 0:58, 


1 2 
a a (=28 
286 joo" ) 


pairs of observation obtained the 


= 460, Ixy = 508. ; 
irs (6, 14), (8, 6) while the correct & s Ee > 
he correlation co-efficient. 
; 
150 
. 1H | & 
= 436 f . 
= 520 
g é — a a 
(Zy) 3 
12 2 he 
y —(Zy) é R a 
roa = 
~(125)(100) : 
| 2 
(25)(436)—(100) 3 : : = 
20 2 s | | ~ S 
“400 5-6 3 g 
= ° 
if students having different hts. and F ~ 
& — 
Ww 


0 
10 
5 


Oo 
fon) 
a 
N 
tm 
i) 
N 
or 
i=) 
_ 
_ 
Oo 
= 
a 
[=] 
=) 
_ 
cm) 
Oo 
af 
oS 
[=] 
eb 
foe} 


Dy 
i) ° 
: 
[2] 
oO 
a 
6 
= nN 
g 
Total a en 
Mo} 
=| 
ts] 
18 g - 7 
haa N 
oO 
27 Ss 
= 
& 
Hh ioe) —_ 
- 2 | | 
‘§ 
20 S 
a 
100 2 a wn 
(0) a x 
oO Q c 
a 
and wts. | 
ss] 
R ee = 
o a) Nol 
— | | 
Oo a) 
ms 4 i 
i—) 
77) 
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1 
25-<— (37)(14) 


r= Wns i ar =. ae = 0-09. 
[123-— 37 (404 - —— 14 
100‘ ) 100 ‘ ) 


Ex. 13-7. Find the variance of the variate 
Ue = AX, + 2X2 +. TAX y 
in terms of variances of x), x2 etc. 
Sol. Let i, x}, X, etc., be the expected values of the variates. 
Then u = E(u) = E(ayx, + aox2 + ..... + ApXn) 
ie., = aX, + 42X24 ...... + AnXy 
Var(u) = Efu—u}? = Efay(x, — %)+ ay (x2 —¥p) +. + An On — Eq}? 
= EfaP(xy — %)° + 43 (2p)? +2 + a2 (tq — Fy)? 
+ 2ayaq (x; — X})(%2 —X2) + } 
= af E(x, -%1)° +3 E(x) -%))’ +... 
+ dy? En — ¥q)° +20 ,4y E(x, — ¥) (Xz — Fy) t eves 
= ay” var (x1) + ay” var (x2) +... + ay? var (Xp) 
+ 2a;a7 Cov (x1, X2) +... 


(where a’s are constants) 


Ex. 13-8. if 3%, os and oF! y be the variances of x, y and x — y respectively, show 
that 


02402 -62., 
lb eon os 
a 20, Sy 


Sol. 7-3 = var (x-y) = var (x) + var (y) — 2 cov (x, y) 


o2 + o; — 2rpy Ox Dy 


o + oF - 62: y 
ry = . 
20, Sy 
Ex. 13-9, Find the correlation co-efficient between x and a — x. 
Sol. Let u=a-x 
Then uw = a-Xx 


where u, x are expected values 
: var (u) = E(u- i)? =E(x-x)* = 0° (say) 
cov (x, u) = E{(x- %) (u-i)} = E(x-¥)? =- 0 


-9O 


hy = TT =-1 
id 6,6 
Ex. 13-10. Find the correlation co-efficient between x + y and x — y (it is given that x 
and y are uncorrelated). 


Sol. Let u 


x+y, v=x-y 
Then u= X+y,v=x-y 

where a, v etc. are A. Ms. 
, cov (u,v) = 


E{(u-u)(v-v)} 
E{{@- x)+-Y)H{@-xX)-O-Y)}] 


CORRELATION CO-EFFICIENT AND I 
te 


where O;, Oy etc., are s.ds. 


Also var(u) = 
Leé., | a - 
and a= 


If r be the correlation c 


r= 
Ex. 13-11. [fx andy are two ¢ 


co-efficient r, show that the correi 


Sol. Letu=x+yando the 


Then uo= 

where u,x,y are A.Ms. 
Now var(u) = 
cov (u, x) = 


Correlation co-efficien 


Tux 


Ex. 13-12. [f x1, x. and x3 be 


' the correlation co-efficient betwe 


Sol. Let u and v be the Al 


Then u= 
where X,, Xz etc., are A.Ms. of x 
Now var(u) = 
where var (x;) = 
- Similarly var (v) = 
cov (u, v) = 


... Correlation co-effcient be 


r= 


Ex. 13-13. Two variates x. 
correlation. Show that 

U=xcosat+ysino¢ 

have the same variance 0° and z 
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(37)(14) 
= =0.09, 
404 - — (4)? 


(where a’ are constants) 
if the variates. 


) 


Xn 5 

)+ az (x2 —X2) ++ ay (Xn - Xn} 
= \2 2 = \2 

—¥2)° +... + Gy Xy_—-Xy) 


» E(x, — X)(%2 Xz) + 


+ sae $y? Var (Xn) 


s of x, y and x — y respectively, show 


2 cov (x, y) 


Oy 
eenxanda-—x. 


x)? =o (say) 
= -E(x-x) =-0 


ween x +vandx—y (it is given that x 4 


Y)}(x-X)-O-¥)F] 
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i 


E{(x-X) -(y-9)"} = 6} - 6} 


where 6, Oy etc., are s.ds. 


Also var(u) = var (x) + var (y) + 2 cov (x, y) 
ie, 6, = o,f +4, 
and 6,” = of +6, 
If r be the correlation co-efficient between u and v, 
o% - oF 
o2 + oF , 


Ex. 13-11. /fx and y are two correlated variables with the same s.d. and the correlation 
l+r-: 
co-efficient r, show that the correlation co-efficient between x and x + y is ‘es : 


Sol. Letu=x+yando the s.d. of x or y. 


Then ueo= X+¥V 
where u,x,y are A.Ms. 
Now var(u) = var (x) + var (y) + 2 cov (, y) = 207 + 2ro” 
= 20° (1+7r) 


cov (u,x) = E{(u-u)(x-x)J=EL{(x-x)+(y-y)}(x-X)] 
= E(x-x)? +E{(x-X)(y-j)} = 07 + cov (x, y) 
=o (1+r 


Correlation co-efficient between u and x is given by 


ov (+r) _ fier 


tux = = 
“6? fier) V2 
Ex. 13-12. [fx;, x2 and x3 be uncorrelated variables each having the same s.d., obtain 
the correlation co-efficient between u =x, + x2 and v= x2 + X3. 
Sol. Let a and v be the A.Ms. of u and v respectively. 


Then “= x] +Xy and y = X +X4 
where X,, X2 efc., are A.Ms. of x; and x; efc., respectively. 
Now var(u) = var (x;) + var (x2) = 15° 
where var (x}) = O° etc. 
Similarly var (v) = 207 
cov (u,v) = E{(u—w) (v— ¥)}= EL (4, - ¥1) + (42 ~F2) (42 — ¥2) 
+(x3-23)}] 
= E(x, -X 7 (°.” cov (x1, x3) =0 etc., 
, as Xj, X2, X3 are uncorrelated). 
= 6 
.. Correlation co-effcient between u and v is given by 
eee 
2022 


Ex. 13-13. Two variates x and y have zero means, the same variance o° and zero 
correlation. Show that 
U=xcosat+ysino and V=xsina—ycos a 
have the same variance 6° and zero correlation. 
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Sol. Since x and y have zero correlation, cov (x, y)=0 ie., E(xy) =0. 
Now var(U) = cos” o var(x) + sin’ o var(y) =o 
var (V) = sin’ o var (x) + cos” o var (y) = 07 


cov(U, V) = E[((U- U)\(V-F)] 


Now U = cosax+sinay=0 
and V = X sina - ycosa=0 
x cov (U,V) = E(UV)=E{(x cosaty sin a) (x sina—y cos a)} 
= cos asin o E(x’) — sin a cos & E(y*) 
= cos asinao’-sinacos ac’ =0. 
Ex. 13-14. [fu = ax + by; v= cx + dy, show that 
var(u) cov(u,v) a bi var(x) cov(x,y) 


cd 
Sol. u=ax+by and v=cxt+dy 


cov(u,v) — var(v) cov(x,y)  var(y) 
=> u=axt+by and v=cx+dy 
u-u=a(x—-x)+b(y-y) and v—-V=c(x-—Xx)+d(y-y) 
o2 = var(u)=E{a(x-X)+d(y-y)}" 
= @o2+h? Oy + 2ab cov (x, y) 


o? = var(v) 


= ?oft+d 6,° + 2cd cov (x, y) 

cov (u,v) = E(u-u)(v-¥v) 
= Efa(x-x)+b(y-y)}{c(x-x)+d(y-y)} 
= aco, + bd oy + (ad + be) cov (x, y) 


ies o cov (u, v) 
*S cov(u,v) oF 
2 of + bo} +2abcov(x, y) eos saa +(ad + bc) cov(x, y) 
ac o + bdo? +(ad + bc) cov(x, y) ce ot ee he oF + 2cd cov(x,y) 
a 
Apply Ry Ri Ke 
Cc 
d 
vee ae “ME (bo? + acov(x,y)} noe PCT A (do? +.c.cov(x,y)} 
aco? + bdo? +(ad + bc) cov(x, y) co a +d? 6? y +2ced cov(x, y) 
bc—ad bo +acov(x,y) doy +c.cov(x, y) 
~  ¢ ac o + bdo} +(ad + bc) cov(x, y) co aed ot ) +2cd cov(x, y) 
Apply Ri > R,-d. R 
_ bc-ad bo} +acov(x,y) de; +c.cov(x,y) 


co cfacz +bcov(x,y)} c{co2 +dcov(x,y)} 


CORRELATION CO-EFFICIENT AND LIT 


b oF + 

= (bc - ad) 5 
aoy + 
Apply C, - 
(ad-£ 
(bc — ad) G d 

(bc-—ad) 
d 

Apply C2 - 
a tKs Snap COV 
hewiad) ov: 

= (be-ad)’ Ay. 


Ex. 13-15. [fu =ax + byand: 
respective means and if the co-effic. 
uncorrelated, show that 


6, .O, = ¢ 
where Oy, Oy etc., are s.d. of u. Vv. et 
Sol. Now 6, = Vv 


V 
6,” = Vv 

cov (u,v) = £ 
G 

( 


6,7 6,7 — cov’ (u, v) 


=A 

=i 

But cov(u,v) = C 
6,0, = ( 


Ex. 13-16. x1, x2 are two varia. 
correlation co-efficient between the 
are independent of p such that x, + 


Sol. Let u=x,te 
: uo= xX;+e 
cov (u,v) = Ef{(u 

= E{(xy 

= E(x - 


= oy + 


MATHEMATICAL STATISTICS CORRELATION CO-EFFICIENT AND LINEAR REGRESSION 517 
a a a he 


=0 ie, E(xy) =0. 
x var(y) = o 
a var (y) = 


boy +acov(x, y) doy +c.cov(x, y) 


(bc — ad) 


‘ac + bcov(x,y) cot +dcov(x,y) 
b 
Apply i a OF “a C2 


(edie) cov(x, y) do? +c.cov(x,y) 
= (bc — ad) (ad — bc) 


d 


+y sin Q) (x sin ®—y cos G)} 
sin & cos o E(y’) 


x cos & GO” = 0 oF co2 +d.cov(x,y) 


Bee (bc—ad)? | cov(x,y) do; +¢.cov(x, y) 
cov(x,y) d o co? +d.cov(x,y) 
var (y) Apply CGE, 
per ee ad)? cov(x, y) af 
ee of — cov(x,y) 
-¥)+dQ-y 
2 
ie 2 OY cov (x,y) 
i Cs b mars 
; Cee cov(x, y) oF 
= (be-ady A,,. 
Ex. 13-15. [fu = ax + by and v = bx — ay, where x and y represent deviations Jrom the 
) respective means and if the co-efficient of correlation between x and y is r and u and v are 
uncorrelated, show that 
ee ee = 
)+dy-y)} O,.O, = (a +b) 6,0, yl-r*. 
Gy) where 6,, Oy etc., are s.d. of u. v. etc. 
ME Sol. Now 6,2 = var (u) =a 62 +B? oy + 2ab cov (x, y) 


6,” = var (v) =B’ 62+ a oy — 2ab cov (x, y) 


cov (u,v) = E{(ax + by) (bx — ay)} 
= ab (6, - 6,”) + (b? - a’) cov (x, y) 


Pay a Be era ey ee i ae 
aco% + bday +(ad + bc) cov(x, y) 6," Gy — cov’ (u, v) = (a +b’) {0,° 6," — cov «») 
cot +d? ay +2cd cov(x, y) = (#+bY 626, {1- oT G y) 
Ox Oy 


(a2 + BY 6, 6, (1-7) 
But cov (u,v) = 0 


6,0, = (a +b’) oy y1-r?. 


Ex. 13-16. x,, x2 are two variates with variances 6," and 6,” respectively and p is the 
correlation co-efficient between them. Determine the values of the constants a and b which 


bona {do} +c.cov(x,y)} 
c 
2 o? +d? 3} +2ced cov(x, y) 


2 are independent of p such that x, + axz and x, + bx, are uncorrelated. 
do}, +c.cov(x, y) 
22 "5 2 : Sol. Let u= x,;+ax2 and v=x,+ bx. 
) cox +d" oy +2cdcov(x,y) F ae Bie ade = 


cov (u,v) = E{(u-u)(v—v)} 

EX (x1 — X1)+a(x2 —))} {(%1 — Xy) + (xz -X2)} 

= E(x, — ))? +(a+b) E{(x,-%,) (x) — ¥,)} 4+ abE (xy -%,)" 
oO + (a + b) cov (x, x2) + abo,” 


‘i +c.cov(x, y) 


i +dcov(x,y)} 
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= 6)’ + abo,’ + (a+b) poio2 
Now cov (u,v) = 
(6/7 + abo,’) + (a+b) poo, =0 
Since a and b are independent of p, a and b are given by 
6,’ + abo,’ = 0 


and at+tb=0 
o 
a= ~p=—t. 
62 


Ex. 13-17. Jf x and y are two variates each with mean zero and variance unity and ry 
=r(#-1), find ‘b’ so that ‘x + y’ and ‘x + by’ may be uncorrelated. 
Sol. Let u=xty and v=x+t+by 
Then uw = x+y =0 and v=x+by=0 
, 0=cov(u, v) = E(xt+y) (x+ by) 
= E(x) + (1+) E(xy) + bE(y’) 
= (1+5)(1 +r) 
ae 1+b=0 as r#-1. 
=> b= -1. 
Ex. 13-18. [fx and y are independent random variates, show that 
Hxty,x-y=P (x xty)-P x ty) 
where r(x + y, x —y) denotes the co-efficient of correlation between x + y and x — y. 
Sol. Since x and y are independent, 


cov (x, y) = 0 (1) 
Put X.= xty, Y=x-y 
: var (X) = var(x) + var(y) + 2 cov (x, y) 
= 0,’ +6, 
and var (Y) = var(x) + vary) - 2 cov (x, y) 
= 6, +0, 


Now X = x+y, Y=x-j 

cov(X, Y) = E{X- X}{Y-¥} 
= E{(x- X)+(y-J)}{(x-¥)-0-9)} 
= He xy -(y-y)’} 


wy) cash 
WXX,Y) = aS 
of +07 


cov (x, X) = E(x—X%)(X-X) 
= E(x—x){(x-x)+(-y)} 
= E(x— x) +E(x-X)-(y-¥) 
= 6, +cov (x, y) 
| 6, = 
cov (y, X) = E(y-y)(X-X) 
= Eiy- y){((x-x)+0-y)} 
= E(x — 3)-F)+ EOF) 


; 2 
- ox eens) ae 
r(x, X) = 5 ee oD 
0; + 5,5, o; + oy 


CORRELATION CO-EFFICIENT AND LINEAI 
en a 


and 


x 
show that var (X ) 
Deduce that r 
Sol. x 
E(x) 

var (X ) 

Since var (x ) 

1+ 2r 

> r2 


rod) = 


¢ 


oF 


X,Y) = P(x, 
Ex. 13-19. x), x2, x3 are three varia 
efficient between any two of them is r. If 


X, +. 


SF wlevolewle 


Nila ©& 


Ex. 13.20. x and y are independent r. 
1. Find ‘a’ so that the correlation co-effi 


Sol. 


Put 
Then 


By given 
x 


cov (x, y) = 


x 


xX 
Cov (X,Y) 


var (X) 
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») PO1G2 


iven by 


1ean zero and variance unity and ry 
2 uncorrelated. 

+ by 

r+by=0 


1) + BE(y’) 


riates, show that 
tion between x + yandx-y. 

(1) 
cov (x, y) 


cov (x, y) 


{(x-x)-(y-y)} 
3 
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COE CIENT SND LINEAR REGRESSION <—- c ce = D 
F Co y 
om eae aR: 
Oy +O 
y 
(X,Y) = 7° X)-?(,X). 


Ex. 13-19. x}, x2, x3 are three variables each with variance 0° and the correlation co- 
efficient between any two of them is r. If 


= Xi +X2 +X3 


show that var (x) 
Deduce that r 
Sol. x 

E(x) 

var (x) 

Since var (x ) 
1+ 2r 

> r2 


3 > 
cea 
(1 +2/) 
1 


2 
Xy +X +X3 


3 

x] +X +X3 
3 

E{%— E(x)}? 


E {1 —%1)+ (x2 — XQ) +(x3 =} 
3 


5 Ella - 3)? +03)? +65 -HP? 


+ 2(x1 — X1) (x2 XQ) +2(x2 — X2) (x3 —X3) 
+ 2(x1 — X1)(x3 —X3)} 


{307 + 2 cov (x1, X2) + 2 cov (x2, x3) + 2 cov (xj, x3)} 
{30° + 6077} ("cov (x, x) = 0 rete.) 


o (1 +2r) 


NIH SS wile wolewoleH 


Ex. 13.20. x and y are independent random variables each with mean zero and variance 
1. Find ‘a’ so that the correlation co- efficient between x + ay and x + y is maximum. 


Sol. 


Put 
Then 


By given 


x 


cov (x, y) 
X 


x 
Cov (X, Y) 


var (X) 


0=y, O& = o,= 1 

0 

xtay,Y=xty 

o=Y 

E(xY) 

EX@ + ay) (x+y)} 

E(x) + (1 + a) EGy) + aB(y? ) 
lt+a 

var(x) + a var(y) 


CORRELATION CO-EFFICIENT AND LINE, 
oe UDMA sis oO 


; Ex. 13-22. fx and y are random \ 
= a show that 
var (Y) = var(x) + var(y) Payer 
= 2 v = ysina- 
2 BRE: are uncorrelated if 
rxy = 
= ive 1+a? 2y5, Sy 
tan20 = ——3 
Now maximum value of rxy Oy — 94 
l+a 1 Sol. By given 
- u = xsinat+ 
{20 +a”) irene 

2 2 v = ysina—- 
ea +2a = 2+2a a@ = Xsina+t. 

ié., a’-2a+1 = 0 i = Vein 
=> a=. ee cpu. 7 
Ex. 13- 21. Ifx and y are uncorrelated random variates with means zero and variances cov(u, v) = E{(u—u 
6,2 and oy respectively. Show that the correlation co-efficient between = E{(x-Xx 
u = xsnatycosa = E(—cos¢ 
and v = xcosa—ysina +(x) 
o? = 03 = sin 0 cos 
1S 7 1/2 = sin @ cos 

{(c? - 33) +4023 cosec? 2 ah Now uw and v are uncorrelated if 
cov (u, v) = 


Sol.. By given => sin & cos a (6,’ — 6,") ~ 46, 


x = 0=y, 2yo,,0 
; var (x) = O71, var(y) = 67" aa tan 20, = -" 
and cov (x, y) = 0 mae 
ines w = xsino+y cos ©. Ex. 13-23. i ila va 
ia x cos O1—y sin 0. (i (5 - »} _ Pq 
u = Q0=v n n 
6, = var(u) = sin? O var (x) + cos” o var (y) x n-x Pq 
= sin’ 0.1" + cos 62 (ii) Co (z. ji )-- 42 q 


6, = var(v) = cos" Gvat (x) + sin’ o var (y) 
= cos. a0" + sin’ ear 
6,0, = (sin’ ac, jf c0s" a0") (cos" ao, + sin? 0.02’) 
= sin’ 0. cos” {1° +6," 7200 on} 
+ oi 26,’ {sin ot + cos" ot + 2 sin’ a. cos” 0} 
= sin’ a cos” ao" -0/) +O; 26,7 


xX = np ar 


x 


ty 
cr o™ 
= [& 
| 
v 
NS 
Nd 
i] 
ty 
, 
| 


Cov (u, v) = E(uy) cane pe 
= ~7 & 
= GEG sme ty C08.) 008 Os = emo); n 
= EG - y’) sin & cos & + xy(cos” 0. — sin’ a)} 1 
= (or >) sn a.608 0+ c0¥ (9) {cos” a. — sin’ a} © 7 we 
= = (6) — 6,”) sin & cos 
rer ‘ a 
oO] — 67) cosasina = = 
Tw = 2 2 P 
- 2 2 2 2 22 , 
sin“ Cos a(o4 - 03) +6} 0} (ii) Let “u= ba y: 
n 
2 2 
(of - 93) Th = a 
eee ees Maser: ee ae es 
7 1/2 - en us = Ve 


- 9\2 
{(c? - 33) +40? 03 cosec” 2a} 
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THEMATICAL STATISTICS — 
ie i Ex. 13-22. Ifx and y are random variates with correlation co-efficient y between them, 


show that 
u = xsina+ycos a 
v = ysina—xcosa 
are uncorrelated if 


20, Oy 
tan20 = —3——Z. 
Oy — OF 


Sol. By given 

= xsinat+ycosa 
= ysina-—xcosa 
= Xsina+ycosa 


<1 Fle s 


= ysina~Xcosa 

cov(u, v) = E{(u—u)(v—v)} 

= E{(x— x)sina+(y—y)cosa} {(y— ¥) sina. —(x —*) cos a} 
= E(-cos sin o(x— x)? + sin & cos a(y— y)? 


variates with means zero and variances 
co-efficient between 


: +(x- X)(y- 3)(sin? a—cos? «)} 
= sin & cos a(o,7— Ox OF cov (x, y) cos 2a 
= sin a cos a (6,7 — 0,7) - YO,G, cos 20 
Now u and v are reas if 
cov (u, v) = 
=> sinacosa (s” - 3) — YO.0, cos 2a = 0 
=> tan 2a = uy 
2 Oy —O% 
Ex. 13-23. Let x be a binomial variate with parameters n and p. Show that 
a x 2 Pq 
a (ij) £E (z 7 P} cia 
n n 
var (x) + cos” o var (y) (ii) Cov (z n*) = - 24, q=1-p 
8” 0 2 non 
t var (x) + sin’ o var (y) Sol. (i) Since x is B.V. with parameters n, p 
in’ wae x = np and var(x)=npq 


Ms” {0102 *) (cos" a1, + sin’ 0.02’) 
a1" +6! 200 or} 

‘a. + cos" ot + 2 sin’ o. cos” 0} 
3? - 02°) + 0102" 


-r) - (2) 


_! 2 
iat) E(x — np) 
cos 01) (x cos O— sin 0)} 


La cos a + xy(cos” ot — sin’ a)} 3 nape var (x) 
La cos 0 + cov (x, y) {cos” o — sin Po} 7 n? 
1.acos @ _ npq = PQ 
ps 03) cosa sina n? n 
1/2 
2 “s x n 

2s 3) ot a3 (ii) Let u=-, v= 

2 (of G2) +91 92 n- n 
+ ae 3) _ X . n-x 

(63 o2 Then u=z-,v= 
ere n n 


; V2 
‘ +40? 03 cosec” 2a\ 
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x n-Xx 
cov | —, 
non 


cov (u, v) 


E{(u-i)(v-9)} 


ll 
_—_ 
r~ 
=e 
l 
x [et 
a, 
, 
= 
=i 
Pay 
I 
= 
=] 
a 
— 
——— 


= 
7 -2(? *) 2 BGS) 
n n 
= Oe 
: ©) oe 


13.3. Rank Correlation 


13.3-1. Non-Repeated Ranks ' 
Let n be the number of individuals which are ranked according to two different characters 3 


repeated in either series, both x and y take the same values 1, 2, .... n. 


+1 
Then Xx = By 42+... +n= ee ) 
and Be = Sart Ps.tn= n(n+1)(2n+1) 
ue 


i= a ey 
var (x) = var (y)= "ax sl 


_ (nt))(Q2ntl) _ {aeeoy" n=] 
id 6 2n 12 
Let d=x~y : 
Xd? = Ex-yy = E(x- ¥)-(y-H)}? (oF =y), 
where x and y are A.Ms. oy 
1 i 


Ly ee L(x — x)" = X(y-y)* -24+5(x-7)(y-9) 
n n n a 
= var (x) + var (y) — 2 cov (x, y) 


ee ee 


= + =2 ; 
7) 2 cov (x, y) 
2 3 
n*-1 1 3 
cov (x, = —-— kd 
OD" yo 
.. Correlation co-efficient between x and y is given by 
cov(x,y) 6 2 
r ——4 = — 
(s.d.of x) (s.d.of y) n(n? -1) 


‘r’ is called Spearman’s rank correlation co-efficient. 
Ex. 13-24. Calculate Spearman ’s rank correlation co-efficient from the following data 
Two numbers within brackets denote the ranks of the students in papers A and B respectivel) 
(Z, 1); (2, 10); (3, 3); (4. 43 (5, 3 (6, 7); (7, 2)3 (8, 6); (9, 8); U0, 1); U1, 15); U2, 9 
(13, 14); (14, 12); (15, 16); (16, 13). 


CORRELATION CO-EFFICIENT AND LID 
oon 


Sol. Let R; and R, be the ranks 


Calculation « 


* 
nN 


1 1 
2 10 
3 3 
4 4 
5 5 
6 7 
9 2 
8 6 


.. Spearman’s Rank Correlatio 


J-- 


~ 
ll 


= 0:8. 
Ex. 13-25. Ten competitors in « 
data : 


First Judge : 1 6 
Second Judge: 3 5 
Third Judge: 6 4 


Use the method of rank correk 
approach to common likings in voic 
Sol. Let Rj, Rp, and R3 be the r. 


Calculation ¢ 


_ 
NSAOWNHK DORAN 


SCoOMINDNAWNH 
CIO RNWOUND= 
— 

CAH NONROW 


— 


.. Rank co-eff. of correlation b 


Rank Co-eff. of correlation bet 


ae Pee: 


=) 


ked according to two different characters 4 
spectively. Assuming that ranks are not - 


e values 1, 2, .... n. 
n(n+1) 

= 9 

= n(n+1)(2n+1) 


6 
= 
ssn) 2 n-l 


; 


2n 12 


x)-(y-y)y? (r= 


(7-9)? -2=2(¢- HJ) 
2 cov (x, y) 


2 cov (x, y) 


given by 
6 2 
~=1-—>;— 
y) n(n° ~1) 
nt. 


ition co-efficient from the following datta 
e students in papers A and B respectivel™ 


); (8, 8) (9, 8); (10, 11); (1, 15); 12.9 
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MS 


Sol. Let R; and R; be the ranks for A and B respectively. 


Calculation of Co-eff. of Rank Correlation 


1 1 
10 1 
3 16 
4 9 
5 1: 
7 4 
2 1 
6 9 


.. Spearman’s Rank Correlation Co-efficient is given by 


6xd? 6.(136) 
Pee SS 
n(n? 1) 16.255 
= 08. 
Ex. 13-25. Ten competitors in a voice test are ranked by three judges in the following 
data : 


First Judge : 1 6 5 10 3 2 4 9 7 8 
Second Judge: 3 5 8 4 7 10 2 1 6 9 
Third Judge: 6 64 9 8 1 2 3 10 § 7 


Use the method of rank correlation to gauge which pair of judges have the nearest 
approach to common likings in voice. 


Sol. Let Ri, Ro, and R; be the ranks due to three judges respectively. . 
Calculation of Rank Co-eff. of Correlation 


ONDKNO~TIH CNW 
ent 
SION OWNF COU RA 


OTOP NWOWNDA & 
— 


1 
2 
3 
4 
5 
6 
7 
8 
9 
0 


— 


.. Rank co-eff. of correlation between first and second judge 


10.99 


Rank Co-eff. of correlation between first and third judge 


= 1- OO _ 0.636 
10.99 
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and rank co-efficient of correlation between second and third judge 


= 12828 220:297, 
10.99 


Since the correlation between first and second judges is —ve, opinions regarding voice 


test are opposite of each other. Similarly the opinions of second and third judge are opposite 
of each other. But the opinions of first judge and third are of similar type as their correlation 
is positive i.e., their likings and dislikings are very much common. 

Hence first and third judges have nearest approach to the common likings. 


13.3-2. Repeated Ranks 


In this case two or more individuals are bracketed equal in either or both classifications. 
Here, common ranks are given to the bracketed individuals. This common ranks is the 
average of the ranks which these individuals would have assumed had they been slightly 
different in ranks from each other. 

The ranks co-eff. of correlation when 4), f .... tp figures are given same ranks, is given by 


_ 6(Zd? +T) 
n(n? -1) 


where T 


ll 
Ms 
— 
S| 
— 
ne 
I 
ans 
— 


Ex. 13-26. Find spearmans rank correlation co-efficient for the data given below : 
Students : 1 2 3 4 5 6.7 8 9 10 11 12 
MarksinExam.A : 15 13 17 14 18 12 20 16 18 17 =«©19 21 
Marks in Exam.B : 18 16 18 15 19 16 18 15 21 17° 18 20 

Sol. 


Calculation of Rank Co.eff. Correlation 


Here in paper A, two students have got 18 marks each and two 17 marks each. While 
marking ranks, ranks 1, 2, 3 are given to students getting marks 21, 20, 19, ranks 4th and 5th 
are to be given to students getting 18 each. As they have got equal marks, their ranks should 
be same and hence each is given the rank. 

44+5 
— = 45 


2 
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EE 


Similarly the ranks of students ; 
ot 


2 
In paper B, there are four stude 
them are 4, 5, 6, 7. 
: Rank of each student get 
4+ 


= 6:5: 


Similarly ranks of each student 

9+ 

2 

and rank of each student getting 15 
114 


e 
a 


As in a paper 4, two students g 
6-5, for paper A we have t) = 2, f= 

In paper B, there are 4 students 
students getting rank 11-5. 


= 4,4 
Salt 
~ 42 

1 

= —{ 

3 

= 2+ 

r=1- 
= 1- 


Ex. 13-27. The co-efficients of 
Physics and Maths. was found to | 
-ranks for one student was wrongly 
rank correlation. 

Sol. Here r= 0-4, n= 10 


2 
Byala = 94 
n(n* -1) 
63 
9-6 se So. 
= 10 
=> rd = 99. 


Now, corrected value of Ld’ = 
Corrected value of rank correl 


= 1- 
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i third judge 


zes is —ve, opinions regarding voice 
‘second and third judge are opposite 
re of similar type as their correlation 
ch common. 

1 to the common likings. 


qual in either or both classifications. 
viduals. This common ranks is the 
ave assumed had they been slightly 


res are given same ranks, is given by 


efficient for the data given below : 
7 8&8 9 10 11 12 
20 16 #18 417 «+19 21 
18 15 21 17 18 20 


Correlation 


s each and two 17 marks each. While i 
ng marks 21, 20, 19, ranks 4th and 5th : 
ve got equal marks, their ranks should } 
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Similarly the ranks of students getting 17 marks each is 
6+7 
2 
In paper B, there are four students getting marks 18 each and the ranks to be given to 
them are 4, 5, 6, 7. 
: Rank of each student getting marks 18 
445+6+7 22 _ 
4 4 
Similarly ranks of each student getting 16 marks in B 
94+10 _ 5 
2 
and rank of each student getting 15 marks 
11412 
2 
As ina paper A, two students get the same rank 4-5 and two students get the same rank 
6:5, for paper A we have t, = 2, t) =2. 
In paper B, there are 4 students getting rank 5-5, two.students getting rank 9-5 and two 
students getting rank 11-5. 
ca tz = 4,t4=2, ts=2. 


= 6-5 each 


5:5. 


=11-5 


1 3 1 3 1 3 i 3 
= — {27-2} + — {2° -2}4— {4° -4}+— = 
T= 5-3 + 2-24 WA (2?-2} 


1 3 
4 is 
D {2° — 2} 
- +9-34+4 (64-4 
3 12 =i 
= 24+5=7 
on AT2S#D 
(12)(143) 
Fag IO DOD a Bio 
286 286 


Ex. 13-27. The co-efficients of rank correlation of the marks obtained by10 students in 
Physics and Maths. was found to be 0-4. It was later on discovered that the difference in 
ranks for one student was wrongly taken as 2 instead of 3. Find the correct co-efficient of 
rank correlation. 

Sol. Here r=0-4, n=10 


2 
ie = 0-4 
n(n“ -1) 
2 
ex Rei 6xd 
10(99) 
= Ld = 99. 


Now, corrected value of Ld’ = 99 -44+9=104. 
Corrected value of rank correlation co-efficient 
4 6(104) 


= = 0-37. 
10-99 
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13.3.3. Limits for the Ranks Correlation Co-efficient 
' The formula for rank correlation co-efficient is 


6.2d? 
n(n? -1) 


ao 
where r is the rank correlation co-efficient. Now, r is maximum when Ea” is minimum, 
which is so only when each d is minimum i.e., zero. This is achieved only when ranks of 
each individual are same in either classification. 

Minimum value of Xd” = 0 

Maximum value of r=1 
ris minimum, when La” is maximum. This is achieved only when the ranks in two classifications 
are in reverse order i.e., if the rank of an individuals in one classification is r, its rank in other 


classification is n — r—-1 =n + 1-1. In this case corresponding value of d is 
In - (2r - 1)| 
Maximum value of Xd ? 


=} @-@r-pP 

= 3 {n? + (47? — 4r + 1) -2n (2r—-1)} 
= {n+ 1 -4(n + 1) r+ 47} 

= n(nt+1)P-4(n+1) x ‘ay Pr 


r=] r=l1 


= n(n +1P-4(n +1): mee gets) 


= — n(n + 1 + = n(n +1) Qn +1) 


aye n(n + 1) {4n+2-3 (n+ 1)} 


3 
1 
= AGN) 
6 re 
Min. value of pom lee ee a ST) 
n(n*-1) 3 
= -1, 


13.4. Regression and Lines of Regression 


In case there is some relationship between the variates, the points of the scater diagram 
will be more or less concentrated round a curve. This curve is called curve of regression. 
From this curve, it is possible to estimate one of the variables (the dependent variable) from 
the other (the independent variable). This process of estimation is often referred to as, 
regression. If y (or x) is estimated from x(or y), regression curve is of yon x (or x ony). 

Incase this curve is a straight line, it is called the line of regression and the regression 
is said to be linear. 

Evidently the line of regression is the straight line which gives the ‘best fit in the least 
square sense’ to the given data. 

In case y is treated as dependent and x independent variable, the line of regression is 
called the ‘line of regression of y on x’ and gives the best estimate of y for any given value of 


~ CORRELATION CO-EFFICIENT AND 


x. Inthe contrary case it is called 

of x for any given value of y. 

13.4-1. Equations of Lines of I 
Consider the bivariate freq. 


xa 


yo 


where fj + f2 + ..... fn = N. 

The line of regression is the : 
distribution. 

Let y = mx + c be the equ 
unknowns to be determined by t 

Let Y; = 4 


and. S= , 

i 

According to the method o: 
minimum. 

The normal equations are : 


n n 

2 . 

ie., m >. fix +e>> fi: 
i=l i=l 


Now by def, 


and Wi =) 
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maximum when Ld is minimum, 
iis is achieved only when ranks of 


when the ranks in two classifications 
e classification is r, its rank in other 


mding value of d is 


1—2n (2r—-1)} 
ir+4r} 
r+4), r 
r=l 
ntl), n(nt)Qn+)) 
2 6 
) 2n+ 1) 
(n+ 1)} 
-1) 


tes, the points of the scater diagram 
curve is called curve of regression. 
ables (the dependent variable) from 
‘estimation is often referred to as, 
ion curve is of y on x (or x ony). 

ine of regression and the regression 


which gives the ‘best fit in the least - q 


at variable, the line of regression is 


t estimate of y for any given value of | 
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x. In the contrary case it is called the ‘line of regression of x on y’ and gives the best estimate 
of x for any given value of y. 


13.4-1. Equations of Lines of Regression 
Consider the bivariate freq. dist. 


XD XY XQ wceeeeee Xn 
yr | VQ vveveves Yn 
le 6 ee hr 


where fj +f. + ..... fn = N. 

The line of regression is the straight line best fitted in the least square sense to the given 
distribution. 

Let y = mx + c be the equation of line of regression of y on x where m and c are 
unknowns to be determined by the method of least squares. 

Let Y¥; = mx;+c 


n n 
2 
and s= VAG) => fom; +e-y)? 
i=] i=l 
According to the method of least squares, m and c are to be determined so that S is 
minimum. 
The normal equations are : 


P os n 
== 2 aa eit ee -y;) 
i=] 
n n n 
i.e., m> fimited f= d ivi 
i=l i=l i=] : 
ie., mx +c=y (1) 
os w 
and 0= oy => 2 fi (mx; +c—y;)(x;) 
m0 = 
ie., m 3 fix} +e, Sixi = FiXi¥i (2) 
i=l i=l : 
Now by ae 
F 1 n =) 1 n 
62-= a > Fi (4%; -X) sae f(x? +X 2 _ 2x, x} 
; f=] i=l 
n 


ree 7 2 
and Ly = cov (x, yaa YX fi -*) 0, -Y) 
; ‘ isi 
eee Fi iY; = ry Xj¥+x.y} 
N A I i 
| ee yee 
= WN yy Si XiY; x.y 
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n 
DS SiXi¥i = N(uirt ¥.¥) 
i=] 


Substituting in (2) 
moe + ¥?)+¥e = py + ¥.¥ (3) 
Solving (1) and (3) for m and c 
m= Pu and c=y-Ph 
ox 


Eq. of line of regression of y on x is 
= uy a 
yoy = “(= %) 
Similarly the line of regression of x on y can be shown to have its equation 


ee Moe y). 


Ex. 13-28. [fa is the angle heen the two regression lines in the case of two variables 


x and y show that 
2 
l-r 0,0 
tana = +5 
r o¢+o 


y 


where the symbols have their usual meanings. 
Sol. The equation of two lines of regression are 


yoy = oF -¥) ...(1) (y on x) 


x 


x-# = Vo-7) — ..Q ony) 
Case I. If py) # 0. 4 
o 
Slope of (1) = Au =r 
co x 
2 
o o 
Slope of (2) = Sa 
Hy ro, 
where r is the correlation co-efficient between x and y 
1S, roy 
~ ¢ --? oo 
Now anes ee os 
en oy ro OX +04 
oF 


Case II. If 4, = 0 
Lines (1) and (2) become 
y=y and x=<Xx 
which are parallel to as ees axes 
= 90°. 


13.4-2. ee Co-efficients 


The quantities —>- En and Pu 


ae} 
respectively and are ie by. b,x and byy respectively. 


are called regression co-efficients of ‘y on x’ and ‘x ony’ 


CORRELATION CO-EFFICIENT A 


13.4-3. Properties of Regre 
(i) The correlation co-efj 
Regression co-efficients « 


Dye * 


By. Day * 


where r is the correlation co-' 
(ii) The correlation co-efj 

regression co-efficients. 
Regression co-efficients 


| Dy t Oxy 
2 
Remarks. (1) 5,, is the s 
the slope of line of regression 
(2) byx, by and r are of : 
Ex. 13-29. The ages (x 
below: 


Ns 


_ Age in ye 
(x) 


Determine the least squar 
co-efficient of Y on X. 
Also estimate the blood 5 
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(3) 


to have its equation 


‘ines in the case of two variables 


nx) 


ny) 


yO, 


2 2 
+O 


-efficients of ‘yonx’ and ‘xony’ 
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13.4-3. Properties of Regression Co-efficients 


(i) The correlation co-efficient is the geometric mean between regression co-efficients. 
Regression co-efficients are given by 
— Hu — Hi 
byx = —> and by, = ) 
x Oy 


2 
bye. Dry = [te] =? 


O, Oy 
where r is the correlation co-efficient. 
(ii) The correlation co-efficient cannot numerically exceed the arithmetic mean between 
regression co-efficients. 
Regression co-efficients are given by 


by, = ELL r— and by = ELL = poe 
1] x Oy Oy 
2,72 
by, + By, ay OF +0; 
2 20, Oy 
byx + Byy a +04 
; r= be 
20, Oy 
2 
o,-o 
We x) a 
Ox, Oy 
by, +b 
; -S Lae | 


Remarks. (1) b,, is the slope of line of regression of y on x and b,, is the reciprocal of 
the slope of line of regression of x on y. 

(2) byx, by and r are of same signs. 

Ex. 13-29. The ages (X) and systolic blood pressures (Y) of 12 women are given 
below : 


Age in years Blood Pressure 
(*) (Y) 
56 147 
42 125 
72 160 
36 118 . 
63 149 
47 128 
55 150 
49 145 
38 115 
42 140 
68 152 
60 155 


Determine the least squares regression line of Y on X and find the value of the regression 
co-efficient of Y on X. 
Also estimate the blood pressure of a woman whose age is 45 years. 
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Find (i) the co-efficient of cor 
(ii) the two regression lint 
Sol. The calculating table is o1 


(i) .. The co-efficient of cor 
= 
le 
2 aee8 
~ (122 
_ 9% 
121: 
ea (ii) u= y= 
e 2 = 2 = 
Let the line of regression of y on x be and Ou co 
y = artbx 1 
where the co-efficients ‘a’ and ‘b’ are given by the equations = (lot 
Ly = na+by<x 7 hk 
and : Xxy = adx+ bir’ Puy 7 ie lit j a 
Substituting the values of Lx etc., The equations o ae 0 
4 = 12a+4b or 1 = 3at+b (v+0-08) = 0-80 
and 1766 = 4a+ 15525 or 883 = 2a+776b 


and (u+0-08) = 0-80 


Solving ; The equation of lines of r 
a = -0-:046 and b=1-138 . fy-40 
The equation of line of regression of y on x is ( 10 aah 08) =0'80 
y = € 0-046) + (1-138)x Se y = 0:80 
The equation of line of regression of Y on X is 
Y—140 = — 0-046 + (1-138) (X¥- 52) and Ge + 0-08) = (0-81 
or Y = (1-138)X + (80-778) 10 
‘ Regression co-efficient of Y on X = (1-138) or x = 0:80 


Now value of x for X = 45 is (45 — 52) =-7. Ex. 13-31. In a partially desti 


Estimate of y = —0-046- 7-966 data, the following results only are i 
= ~8-012 and 40x — 18y = 214. 
Estimate of Y for X = 45 Supply (i) mean values oJ 
= 140-8-012 (ii) the s.d. of y. 
= 131-988 (iii) the correlation 
Ex. 13-30. For the following table : Sol. (i) The equations of line: 
Age of Ages of wives in years y-y = by | 
husbands) —_—__— $$ —_________ ana Pe aoe ee 
in years 10 — 20 20—30 30—40 40 — 50 50-—60 _— Total , * 


15 — 25 6 : 3 = pan ee 9 Thus x and y are the valu 

25 — 35 3 16 10 — a 29 simultaneously. 

35 — 45 —_ 10 15 7 a 32 Solving regression equati 

45 — 55 — — if 10 4 21 xX =13 and y=17 

55 — 65 — oo — 4 5 9 (ii) Itis not given, of the two reg 
OT 

Total -) 29 32 21 9 100 ~ of y on x. So we assume, 

pa a a gy =. OOS 
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tions 


1 = 3a+b 
883 = 2a+776b 
8 
s 
2) 
ars 


40 — 50 50—60 Total 


= — 9 
a — 29 
7 —_— 32 
10 4 21. 
4 5 9 


21 9 100 


a i 


CORRELATION CO-EFFICIENT AND LINEAR REGRESSION 531 


Find (i) the co-efficient of correlation, 
(ii) the two regression lines. 
Sol. The calculating table is on page 532 
(i) _ .”. The co-efficient of correlation is given by 


1 
98- 100 (-8)(—8) 


a Se 
| 1 2 1 2 

122 -——(-8 122 -_—_(-8 

00‘ ) 100‘ ) 


_ _9800-64 
~ (12200~64) 
12136 
% ~ _ (-8) 
k= = — =). 
(i) Ve Tao oe 
2 
1 8 
Dring Bie ES sss ie 
and Ou oO, 100 {122} ( = 
1 
= x {12200 - 64} = 1-2136 
(100) 
by = by, = r=0-802. 
The equations of lines of regression of v on u and u on vy respectively are 
(v+ 0-08) = 0-802 (u + 0-08) 
and (u+0-08) = 0-802 (v + 0-08) 
: The equation of lines of regression of y on x and x on y respectively are 
y-40 } Gre 
+ 0-08} = 0-802 + 0-08 
( 10 os 10 
or y = 0-802 x+ 11-772 
x-35 y-40 
+ 0-08] = (0- + 0-08 
and ( 10 } (0-802) ( 10 
or x = 0-802 y+ 2-762. 


Ex. 13-31. Jn a partially destroyed laboratory record of the correlation analysis of 
data, the following results only are legible : var {x) = 9, regression lines 8x — 10y + 66 = 0 
and 40x — 18y = 214. 

Supply (i) mean values of x and y. 

(ii) the s.d. of y. 
(iii) _ the correlation co-efficient between x and y. 
Sol. (i) The equations of lines of regression of y on x and x on y respectively are 


yr-y = Dyy (x-x) (1) 
and x-x = by(y-y) (2) 
Thus x and y are the values of x and y which satisfy both the regression equations 


simultaneously. 
Solving regression equations 
xX =13 and y=17 
(ii) Itis not given, of the two regression equations which represents the line of regression 
of y on x. So we assume, 


532 


Let x be the variable for the ages of wives and y be the variable for the ages of husbands. 


an 
N 
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foe) 


— 
N 


_ 
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&8x—10y+66 = 0 
to be the equation representating th 
the line of regression of x on y mu: 

40x -18y = 21 
Comparing with (1) and 


4 
by = 5 

The co-eff. of correlatio; 
4 

1° = by. Dry = 7 


Since 7? comes out to be less t 
Since by, and by, are positive, j 


3 
rz=nore 
5 
(iii) Now b,, = poz 
Oo, 
4.3 
5. S. 
or oO, = 4. 


Ex. 13.32. Two random variabl 
and 6x + y—31=0. Find the mean 
Sol. Solving regression equatic 


x= 4 ; 
Now slopes of regression lines 
a 
2 
Since rs 1, 
3 
by, = — 
‘ 2 
r= dy. 
r= —0-5 


Ex. 13.33. For a bivariate dist) 
3y + 9x = 46, Find the mean of the d 
Sol. Solving regression equatic 


x=Sa 
Now slopes of regression lines a 


Bi Sie 


r 


i] 
= 
Rad 


sae 
av 
Ex. 13-34. Given that the lines « 

* and 4x —y~3=0 and the second mc 
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8x — 10y +66 = 0 
to be the equation representating the line of re 
the line of regression of x on y must be 
40x - 18y = 214 
Comparing with (1) and (2) 


gression of y on x. Then equation representing 


4 
by. = 3 and by = 


aj 
No 
a) 


The co-eff. of correlation r is given by 
ye ees (<1) 
ess 5 20 25 ; 
Since r* comes out to be less than unity, our aSsumption is correct. 
Since b,, and b,, are positive, r must be positive and hence 
3 


= - =0-6 
a. 
oO 
(iit) Now by,= r— 
Oy 
4 _ 3 Y 
5 5.3 
or 0, = 4. 


Ex. 13.32. Two random variables have the least squares regression line 3x + 2y—-26=0 
and 6x + y— 31 = 0. Find the mean values and the correlation co-efficient. 
Sol. Solving regression equations 
x = 4 and y=7 
Now slopes of regression lines are 


- 2 and -—6 

2 
Since r< 1, 

1 

byx +~— and by == 

1 
r= bye by = 7 (E1) 
r= —0°5. (. by, by are < 0) 


Ex. 13.33. For a bivariate distribution, the lines of regression are 3x + 12y = 19 and 
3y + 9x = 46. Find the mean of the distribution and the correlation co-efficient. 
Sol. Solving regression equations 


x = 5 and y= 
1 
Now slopes of regression lines are — rr and -—3 


1 1 

bye =~ and By=— 3 

P = by by == 

r>=---—. (. yx, by <0) 
2V3 J: cy 


Ex. 13-34. Given that the lines of regression of y on x and x on y are respectively y = 
*and 4x— y—3=0 and the second moment about the origin for x is 2; calculate (i) the mean 
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for x (ii) the mean for y (iii) variance of x (iv) variance of y (v) the correlation co-efficient 
between x and y. 
Sok Solving regression Ces 


x = 1=y 
Now slopes of regression ee are 1 and 4 
1 
byx = 1, by = 4 


1 
y= a ie, r=0-5 
Now for x, 1’, (0) =2 
Oy = w’2(0)- x2 =2-1=1. 


ry 1 
Also 1=b, = a: = 5 & 
oO, = 2 


Be y : 
Ex. 13.35. Given x = 4y+ 5 andy =kx + 4 are the regression lines of x on y and y on 


1 
x respectively. Show that 0<4k<1.Ifk= 16? find the means of the two variables and the 


co-efficient of correlation between them. 
Sol. Here b,, =k and b,=4 


P = by. by = 
Now since r* is the square of real ae it should be nonegative. 
Since bxy # 0, r#0 
also as two lines of regression are different, rP#l. 
ns 0<Pr<!1 
=> 0 < 4k<1 
1 
When k = 16 the equations of lines of regression become . 
x = A4yt5 (x on y) 
l6y = x+64 © (y on x) 


Sebane regression equations 
xX = 28 and y=5-75 


| 
> 
= 
ll 
zs 
| 
I 


Also r= 


n= = 0-5. : (yx, by > 0) 


Ex. 13-36. The lines of regression obtained in a correlation analysis are 


1 
x+9y = 7 and yt4x= 167 
Findthe (i) the co-effi iclont mf correlation 


(ii) the ratio oY: : cov (x, y). 
Sol. Slopes of regression ie are 
- - and —4 
9 

Since rs, ] 
1 1 1 
bd, il =— eel 1) 4q 
yx 9’ by 4 ( : 
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Se ee 


CS bie 


“ 
I 
— 


Se 
6 
Also by = a 
9’ 
: 1 
Ms Pus _ = 
Ot 9 
> Wi: 6 
-1:9 
Ex. 13-37. For two variables x 
xt+2y-5 = 0 
and 2x+3y-8 = 0 
Also var (x) = 12 


Find x,y, oy and r. 
Sol. Solving regression equatio 
xX = 1 anc 


Slopes of regression lines are — - 


ba 


r+e- 
Also b ee and o, = 2, 
x 
(4) -(4 
2) > 2 
Oy = 2. 


Ex. 13-38. For 10 aber 
obtained (in appropriate units) : 
zx = 130, } 
Obtain the line of regression of 
units. 


Sol. Let the equation of the line c 


y = at bx, 

where the co-efficients ‘a’ and ‘b’ are 

Zy = natb, 

and oxy = aXxt. 
Substituting the values 

220 = 10a+: 


and 3467 = 130a+ 
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of y (v) the correlation co-efficient 


e regression lines of x on y and y on 


> means of the two variables and the 


id be nonegative. 


1 become 


(byx» ry > 9) 


correlation analysis are 
1 


3 


(1 
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? = by. by = 
DAG 
1 fe 
r= - 36 C. By, by <0) 
eee 
6 
1 Hy 
Also byx = ~ 9? by a oe 
Hai 1 Hy 1 
. Is = = --, aed 0 RE ec 
( o2 9 oF 4 
=> Hi; : O Oy 
: -1:9 : 4, 
Ex. 13-37. For two variables x and y the two regression lines are 
x+2y-5 = 0 
and 2x+3y-8 = 0 
Also var (x) = 12 


Find x,y, Cy and r. 
Sol. Solving regression equations 
x = 1 and p=2. 


Slopes of regression lines are — ; and — ; 


N fw 


ee | 
by = ~ 5 and by = - 


CIB fea 
v3 


oe C. 


, Byx, Dy <0) 


o 
Also b,, = r— and 0, = 2J3 


( +). [- NE Eon: 
an ee ee 

dis Oy = 2. 

Ex. 13-38. For 10 observations on price (x) and supply (y) the following data were 
obtained (in appropriate units) : 

Xx = 130, Ly =220, Ex’ = 2288, dy? = 5506, Xxy = 3467. 

Obtain the line of regression of y on x and estimate the supply when the price is 16 
units. 

Sol. Let the equation of the line of regression of y on x be 


y= atbx, 
where the co-efficients ‘a’ and ‘b’ are given by the normal equations 
Ly = na+ b&x 
and Ixy = aXx+ bry? 
Substituting the values 
220 = 10a + 130b or 22=a+ 135 


ll 


and , 3467 130a + 22885 
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a = 88 and b=1-015 
The equation of line of regression of y on x is 
y = 88+ 1-015x 
Estimate of supply (y) when the price (x) is 16 units 
= 8-8 + 16-240 = 25-04. 
Ex. 13-39.. From the data given below estimate the most likely height of a father whose 
son’s height is 70”. 
Fathers : Mean height is 67” with a sd. of 3:5” 
Sons : Mean height is 65” with a sd. of © 2:5” 
Co-efficient of correlation between the heights of. ‘fathers and sons is + 0:8. 
Sol. Let y be the variable corresponding to the height of fathers and x be the variable 
for the son’s heights. ‘ 
Then x = 65, py =67, 0,=2°5, 6,=3-5, and ry =0°8. 
(0-8) (3-5) 
by (2-5) 1-12 
The equation of line of regression of y on x is 
y-y = byx (x-x) 
or y-67 = 1:12 (x-65) 
or y 1-12x — 5-8 
Most likely height of a father whose son’s height is 70” 
= Estimate of y for x = 70 
= 78-4-5-8=72-6. 
Ex. 13-40. The following statistical co-efficients were deduced in the course of an 
examination of the relationship between yield of wheat and the amount of rainfall. 


Yield in Ibs Annual Rainfall 
(per acre) (in inches) 
Mean 985-0 12-8 
s.d. 70-1 1-6 


r(between yield and rainfall) = + 0-52. 

From the above data, calculate (i) the most likely yield of wheat per acre when the 
annual rainfall is 9-2” and (ii) the probable annual rainfall for yield of 1,400 Ibs. per acre. 

Sol. Let y be the variable for yield and x be the variable for annual rainfall. 

Then ¥ = 12-8, j =985-20, 0; = 1-6, 6,=70-1, and ry = 0-52 


(0-52) (70-1) 


Dyx — 0-6) = 22-7825 
(0-52) (1-6) 
, = + ——_ = 0 7 
and bey ae 0-0118 


The equations of lines of regression are 
y—985 = 22-7825 (x— 12:8) 
and x—12-8 = (0-01187) (y— 985) 
. The most likely yield of wheat per acre when the annual rainfall is 9-2”. 
= 985 + (22-7825) (— 3-6) = 902-983 
= 903 
and the probable annual rainfall for yield of 1,400 Ibs. per'acre 
= 12-8 + (0-01187) (415) 
= 17-72605 = 17-7”. 


(y on x) 


(x ony) - 
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Ex. 13-41. The following dat 
rainfall and yield of paddy in a cert. 


Yield (per ¢ 

in Ibs 

Mean 973°5 
s.d. 38-4 


Co-efficient of correlation = 0. 

Estimate the most likely yield o 
being assumed to remain the same. 

Sol. Let y be the variable for yi 

X = 18-3, jy =973-5, o, 


4 
ae (0-5 


.. The equation of line of regre 

y—973-5 = ILL 

.. Estimate of the most likely y 

= Estir 

= 973:: 

= 973: 

Ex. 13-42. /fa number x is cha 

number y is chosen from among thes 
cov (x,y) = 5/8 

Also find the line of regression c 

Sol. Since x is to be selected at 

1, 2, 3, 


prob. of x taking each of these value: 


Now, when x takes value 1, y is 


.. Conditional prob. of y taking 


.. Prob. of each of the pairs 


(, 1), ¢ 
11.1 
44. 16 


Now, when x takes value 2, y is 


.. Conditional prob. of y taking 
and conditional prob. of y taking eac 


Prob. of the pairs 
(2,1), ¢ 
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Ex. 13-41. The following data give the correlation co-efficient, means and s.d. of 


is rainfall and yield of paddy in a certain tract : 


Yield (per acre) Annual Rainfall 
16 units in lbs (in inches) 
: F hose Mean 973-5 18-3 
, most likely height of a father w 
ESE sd 38-4 20 


with a s.d. of 35” 
with a s.d. Es 2°5 
‘fathers and sons is + 0°8. | 
ie of fathers and x be the variable | 


Co-efficient of correlation = 0-58 


Estimate the most likely yield of paddy when the annual rainfall is 22”, other factors 
being assumed to remain the same. 


Sol. Let y be the variable for yield and x be the variable for annual rainfall. Then 
xX = 18-3, 9 =973-5, 6,=2-0, o,=38-4 and Ny = 0-58. 


pee (0-58) (38-4) 
Dy = = 11-136 
(2-0) 

; .. The equation of line of regression of y on x is 

= y—973-5 = 11-136 (x — 18-3) 
.. Estimate of the most likely yield of paddy when the annual rainfall is 22” 
= Estimate of y for x = 22 

height is 70” = 973-5 + (11-136) (3-7) 

70 


973-5 + 41-2032 = 1014-7032 = 1014-7. 


Ex. 13-42. [fa number x is chosen at random from among the integers 1, 2,3, 4 and 


7 g an 
re deduced in the course of an — | 
nts we number y is chosen from among these at least as large as x, prove that 


eat and the amount of rainfall. 


Aaiital Rainfall CoV (x, y) = 5/8 
(in inches) Also find the line of regression of x on y. 
12:8 Sol. Since x is to be selected at random from the integers 
1-6 


1,2,3,4 


a : 
ikely yield of wheat per acre when the ) prob. of x taking each of these values is 4° 
rainfall for yield of 1,400 Ibs. per acre 
e variable for annual rainfall. 


70-1, and ry = 0-52 


Now, when x takes value 1, y is to be chosen out of 1, 2, 3, 4. 


.. Conditional prob. of y taking each of these values is ~. 


.. Prob. of each of the pairs 


7825 
a (1, 1), (1, 2), (1, 3) (1, 4) is 
1 1 1 

01187 A a 46 

(y on 2 Now, when x takes value 2, y is to be chosen out of 2, 3, 4. 
" (x onyi .. Conditional prob. of y taking value 1 = 0, 

: . ” 1 

hen the annual rainfall is 9-2". and conditional prob. of y taking each of the values 2, 3, 4 = — 
~ 3-6) = 902-983 3 


Prob. of the pairs 
(2, 1), (2, 2), (2, 3), (2, 4) are 
1 1 1 1 1 


) Ibs. per'acre 
(415) 
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when x takes values 3, y is to be chosen out of 3, 4. 


Conditional prob. of y taking each of the values 1, 2 is zero 
and conditional prob. of y taking each of the values 3, 4 is = 


Prob. of the pairs 
(3, 1), (3, 2), (3, 3), (3, 4) 
11 ._=#1421 


are 0,0,--- = -,-. 
2 4 8 8 


When x takes value 4, y can take only one value 4. 
Conditional prob. of y taking each of values 1, 2, 3 is zero and the conditional 
prob. of y taking value 4 is 1. 
Prob. of the pairs 
(4, 1), (4, 2), (4, 3), (4,4) 
1 


1 
0,0,0, —~.1 = = 
are 4 4 


Thus, the bivariate distribution is 


yo 1 2 3 4 
xt 
I Et ze fie 
' 16 16 16 16 
a a 
: 12 12 12 
1 
3 0 0 8 g 
4 0 0 0 4 
The calculating table is on next page 
@) cov (x, y) = Zpxy— (Zpx) (Zpy) 
_ 42010 156 
~ 48 4° 48 
ee 
a: 
(ii) Let eq. of line of regression of x on y is 
x = at+by 


Normal equations are 
Zpx = a+ b&py 


and ‘Zpxy = axpy+ bEpy’ 


Substituting values, equations reduce to 


arc 

4 a+b 48 

~ 430 156 $48 
a+b 


and ae ag Pag: 


CORRELATION CO-EFFICIENT AND L 


Ba: of line of a ss of 
= 01 


13.4-4. Standard Errors of Estin 


Find the standard errors of esi 
Sol. The eq. of line of regress 
Vey? = aa 

o; 

Let (x; yj), i= 1, 2 ... n be the 


Let Y; y- 


The standard error of estimate 
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ues 1, 2 is zero 


ag 
18 —. 
2 


b 
»5 1, 2, 3 is zero and the conditional 


ole Sle al- 
ale ole Ble ale 


o 
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Cal. of Covariance 


Ree 1/16 7/48 13/48 25/48 beh 10/4 | 30/4 |420/48 
= 1/16 14/48 39/48 | 100/48 |156/48 


16 | 28/48 ee 400/48 |548/48 ae a ae 
87/48 | 308/48 [420/48 Fo ff 


ie., EAE 48a + 156b 
and 420 = 156a+ 548b 
, a = 0-13, b=0-73 
= 5. of line of ane of x on y is 
= 0:13 +0-73y. 
13.4-4. Standard eee of Estimate 


Find the standard errors of estimate of y and x respectively. 
Sol. The eq. of line of regression of y on x is 


ae 
isk 


= iL 
-y= (x-x) 
y-y o2 


Let (x, yj), i= 1, 2 ... m be the variate value pair occurring with frequency f,. 


Let y= y+ Hh (x,-3) 


The standard error of estimate of y is given by 


1 n 
Ss? = —Y AM-y 
N i=] 
n 
where N= > Si 


2 
en ey lle eee, ee ae 
= Pe (x; -x)-(; m} 


1< Lt ; Hi 
= re aeL [es] (x;- x)? +(;-9)? -2 a (x;-x)(; ca 
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2 2 
= FU yg? —2 Eu 
Ox Oy 
2 
2 Hi 2 
= oy 41-|—— 7 1-° 
y [| ©," ( ) 


a S, = o,(1-7)” 
Similarly standard error of estimate of x is given by 
S, = 6, (1-P)!? 

Note. If r = +1,5,=S,=0 

.. All points lie on both lines of regression and hence two regression lines coincide and 
thus there is a linear functional relation between the variates x and y. 

Asr > 1, Se — 0 and Ss > 0, Le, as r comes nearer to unity, the points are closer 
to lines of regression which are nearer to coincidence. 

‘. The departure of 7° from unity can be taken as a measure of departure of the 
relationship between the two variates from linearity. 

Ex. 13-43. For a given bivariate dist. find the straight line for which the sum of the 
squares of the normal deviations is minimum. 

Sol. Consider the bivariate dist. 


XD (XP XQe ee, Xp 
VP | VyV2 vvererees Yn 
SD MA fa ccccvce Tn 
and let the equation of the straight line be 
xcosatysina—p=0 (1) 


The normal deviation of an observed value pair (x;, y;) from the line is the length of 
perpendicular from the point (x;, y;) upon the line i.e., x; cos & + y; sin & — p. 


n 
Let 5 = > fi (xj cos a +y; sin a—p) 
i=] 
Normal eqs. are 
oS ; . 
0 = oe = > 2f; (xj cos & + y; Sin & — p) (—x; Sin & + y; Cos O) .(2) 
i=] ; 
- Oo < 
and 0 = bp = > — 2f{x; cos & + y; sin & — p) (3) 
i= 
Eqs. (3) and (2) are equivalent to eqs. 
xXcosat+ysina=p (4) 


n - n us 
and cos o& sin & {3 fie - >. fe +cos2a >) fiXi di 
i=l i=l 


i=] 


n a 
+p fine fix; — cosa > wn. =0 
i=l i=l 


Le, cos o sin a{(6, + 77)-(02 + ¥7)}+cos2a(y1 +yx)+ pxsina—pycos a=0— 


ie., {cos & sin a (oy - 02) + cos 2 & Hy} + {cosa sina (7° -x7) 


+ ¥.7(cos” a—sin* «)+ p¥ sina. — py cosa} =0 


CORRELATION CO-EFFICIENT AND LI 
prs actrees ior nese eee ea 


: sin2a /_2 _2 
ie., age (05-03) +cos: 


tan2a = 


Eq. (5) gives two values of a. : 


The corresponding values of p 
the equation of the required line. Ev 


13.5. Correlation Ratio 


Def. Consider the case when 
more than one value of (say yj). L 


Let y= 12 
J 
Then correlation ratio of y on: 
= \2 
YY HOy-Fy = 
ij 
where >. >. fy=N. 
ij ; 
Theorem. Show that 
rst 
Proof. Evidently ae <1. 


To prove 7” < Ty first the equ 
be y=a+ bx. 
The unknowns a and b are giv 


dD fivy=Natb > 
ij t 
and Zz pD frxivy= ad, > 
a | tod 


Leé., y= at 
and U,t+x-y = ax 
Hy 

b= 7 

0; 


Eq. of line of regression 


Dpcecee 
yy. o 
Let y= y+ 
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5y G- r) 


e two regression lines coincide and 
iates x and y. 
earer to unity, the points are closer 


as a measure of departure of the 


tight line for which the sum of the 


.(1) 
yj) from the line is the length of 
COS OL + yj SiN & — p. 


a — py’ 

-x; sin 0 + y; Cos 0) 2) 
QB) 
AA) 

ot 2 SiXi i 


o(uy, +p X)+ px sina — py cos a=0 : 


20S O sin & (7° -x*) 


a. —sin? 0.) + pX sina ~ py cosa} = 0 ] 
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in2 = ae a 
i.é., [ena (c3-02)+-cos2cu.n13 + {ycosa(ysina+xcosa—p) 


— xsina(xcosa+ysina—p)} =0 


ie., aon (05 - o;) +cos2Q. py, =0 . [using (4)] 
tan2q = —HU 5) 
C7 Gy 


Eq. (5) gives two values of «. If one is 8, the other is = +0. 


The corresponding values of p are given by (4). With these values of o and p, (1) gives 
the equation of the required line. Evidently there are two such lines which are perpendicular. 


13.5. Correlation Ratio 


Def. Consider the case when corresponding to any given value of x(say x;) there are 
more than one value of y(say yj). Let the pair (x;, yj) occur with frequency fj. 


Let yi = P ios] / > fy 
j j 


Then correlation ratio of y on x(,,) is defined by 
ania 
2 ~ fg Vg Fi = N 03 (1-13) 
where >. > fy=N. 
iy : 


Theorem. Show that 


Proof. Evidently Nyx <1. 
To prove r? < Thee first the equation of line of regression of y on x will obtained. Let it 


be y=at bx. 
The unknowns a-and b are given by 


pa 2 fix yy= Nat bY pa SiiXi 
I J i a 

md DE fos a E fob Bsa 
tos a | i fj 


Le, y = atbx 
and by +E-¥ = ax+b(oz +z") 
b= ae and a= pf x 
Ox Oy 
Eq. of line of regression is 
Hit “i 
_ => TF x- 
YY. o2 & x) 
Let y, = pret (x, -¥) 
; G 
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2 
Dy dD fy og- WL D fi {ov- DF -»| 
if Oy 


2, Bi Bi 
= N| oy + => - 2-3] =No, (1-7) 
O* O*% : 


Now YL fiov- WY = XL LV HpOy-H 
ij ij 
i.e., the sum of square of deviations in any array is least when they are measured from the 
mean of the array 
Noy (1-r) = No, (1- nye) 

which implies r 

: 2 
P< ny Si 


Note. Similarly as above correlation ratio of x on y (N),,) can be defined and it can be 
shown that 


1A 


2 
Nyx 


IA 


y < mi <i. 

Ex. 13-44. Show that the correlation ratio of y on x is the ratio of the standard deviation 
of the weighed means of the arrays of y 5 (weighed by the corresponding array frequencies) 
to the standard deviation of all y’s of the dist. 

Sol. Let yj (j = 1, 2, ...) be the values of y corresponding to x = x; and fj be the 
frequency of the pair (x; yj). 


Now 


& 
N 
| 


=> fy 0 -PY where y = A.M. of y 
ler 


J 
LX fi y-H)+Gi-PY 
ij 


where y, = [x ap fi 
J F 


> > fy Wy -¥i)? +> > Fj G-YY 
ij if 
+2) > fy Wy -Vi) Gi-¥) 
ij 


N 05 (1-Nyx) + > 0; (9; -¥)* 


i 


x 
| 


where n= > fi E Dy fy (Vy ~F,) = UF, — 4; = 0 
| 1 2 2_ on 
2 5s my 
Myx = Vy du UVP) ¢ | oy => 
ee N i | oy 


| 1 ye 
where Ony = We ni(¥i-¥)” is the s.d. of the weighed means of 
F 


arrays of y’s (weighed by the corresponding array frequencies). 
Ont 
Ox 


Note. For correlation ratio of x on y, ney = 


1. 


we 


10. 


CORRELATION CO-EFFICIENT AND I 
bee ae en 


Calculate correlation co-effi 
x: 5 15 10 
y 21 14 28 
x: 188 19-1 17-6 
y 7:8 7-6 77 


. Husband’s age (x) 20 


Wife’s age (y) 14 


Husband’s age (x) 24 9.27 
Wife’s age () 18 20 


eS & 
— 
No 
— 
nN 
— 
Oo 
— 
b 


y: 23 34 33 34 
. From the index numbers giv 


Months: May June J 
(in 1994) 


Index no. 

of pricesin 169 182 1 
Kolkata (x) 

Index no. 

of prices in 204 222 2 
Mumbai (y) 


. Obtain the co-efficient of c 


city during the period 1990 


Year 1990 1991 
Male - 33 23 
death rate 
Female 45 31 
death rate 

. Calculate the correlation c« 
below : 
Marks (in 15 13 
Exam. A) x 
Marks (in 18 16 
Exam. B) y 


The table below shows the 
vehicle accidents in a city. 

Year 1995 1996 
No. of 2:6 2:8 
Vehicles with 

licences (’000) 

No. of Motor 5-9 6:0 
vehicles 

accidents (’00) 
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2 
HL o-9| 


Jy 


=No, (1-/) 


when they are measured from the 


'(Nxy) can be defined and it can be 


s the ratio of the standard deviation 
: corresponding array frequencies) 


esponding to x = x; and fj be the 


there y =A.M. of y 


yy? 


> YH Gi-y 
ij 
+2> ¥ fp 0y-F) Gi-7) 
ij 
(9; -¥)? 


Vg Vi) = Yi —n,y, =0 


the s.d. of the weighed means of 


uencies). 
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1. 


10. 


EXERCISES 


Calculate correlation co-efficient for the following datas : 
x 5 15 10 20 25 40 


y: 21 14 28 7 35 42 ‘[Ans. 0-49] 
x: 18-8 19-1 17-6 16-8 18-2 19-5 20:00 21:8 21-9 
y: 78 7-6 7-7 75 7:8 7-2 8-0 7-9 7:8 
[Ans. 0-37] 
. Husband’s age (x) 20 30 40 50 60 70 80 
Wife’s age (y) 14 5 30 32 40 45 65 
[Ans. 0-94] 


. Husband’s age (x) 24 27 28 28 29 #30 32 33 «35 «©35~=«640 


Wife’s age (y) 18 20 22 25 22 28 28 30 27 30 22 


[Ans. 0-5 ] 
x: 20 18 16 15 = 14 12 12 10 8 5 
y: 12 16 10 14 = 12 10 9 8 7 2 ~~ [Ans. 0-87] 
x: 28 41 40 38 35 33 40 32 36 = =8©633 


y: 23 34 33 34 30 26 28 31 36 38 ~~ © [Ans. 0:44] 


. From the index numbers given below, find Karl Pearson’s co-efficient of correlation : 


Months: May June July Aug. Sep. Oct. Now. Dec. Jan. Feb. 
(in 1994) 


Index no. 
of prices in 169 182 182 192 198 211 227 238 350 =. 253 
Kolkata (x) 
Index no. 
of pricesin 204 222 255 228 231 233 249 266 255 255 
Mumbai (y) 


[Ans. 0-74] 


. Obtain the co-efficient of correlation between male and female death rates in Delhi 


city during the period 1990-97. 


Year 1990 1991 1992 1993 1994 1995 1996 1997 

Male 33 23 24 28 27 28 22 24 

death rate 

Female 45 31 33 40 35 39 32 34 

death rate [Ans. 0-97] 
. Calculate the correlation co-efficient between the marks in two examinations given 

below : 


Marks (in 15 13 17 14 #18 «120 «6.2006«©160¢6«618~«617)—C«(«19)s 21 
Exam. A) x 

Marks (in 18 16 18 15 19 16 18 15 21 #17 «18 = 20 
Exam. B) y [Ans. 0-703] 
The table below shows the number of vehicles with licences and the number of motor 
vehicle accidents in a city. Calculate the co-efficient of correlation : 

Year 1995 1996 1997 1998 1999 2000 2001 2002 
No. of 2:6 2°38 2-9 3-1 3-2 2-3 2:5 1:8 
Vehicles with 

licences (’000) e. 

No. of Motor 5-9 6-0 6-2 6-2 7-6 7-0 7-4 5-5 
vehicles 

accidents (00) [Ans. 0-366] 


544 


11. Calculate the co-efficient of correlation between cotton and woollen cloth manufacturer 


12. 


13. 


14. 
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from the following data : 


Months July — Aug. Sep. Oct. Nov. Dec. 
Index no. of cotton 103 105 108 106 104 102 
cloth manufacturers(x) 

Index nos. of woollen 75 73 78 71 80 76 


cloth manufacturers(y) 
Jan Feb. March April May June 
108 115 118 114 116 120 
68 65 62 60 58 54 
[Ans. 0-909] 
Calculate the co-efficient of correlation between the production of rice and its price 
from the following table : 


Production 250 270 278 325 260 510 428 320 440 310 
Price 84 50 62 75 90 170 136 65 72 58 
[Ans. 0-74] 
Find the correlation co-efficient of datas given below : 
yo 30-35 35-40 40-45 45-50 50-55 55-60 
x 
80-90 2 3 2 — — — 
90-100 — 2 5 4 2 — 
‘100-110 _ 4 8 5 1 _ 
110-120 © — — 2 3 1 1 
120-130 1 — — 2 1 1 
[Ans. 0-43] 
yo 18 19 20 21 22 Total 
x 
0-5 — — — 3 
5-10 = _ — 3 2 
10-15 — — 7 10 — 17 
15-20 — 5 4 — — 9 
20 — 25 3 2 — — — 5 
Total 3 7 11 16 3 40 


[Ans. 0-837] 


15. Ages of daughters (in years) 


5—10 10—15 15—20 20—25 25—30 Total 
S-. « 16295 6 3 = a = 9 
Se 23-35 a ce 
g S  35—45 — 10 15 7 a 32 
Se 45—55 ee 7 10 4 21 
x 55—65 — = = 4 5 9 

9 100 


Total ~ 9 29 32 21 
[Ans. 0-802] 


S 
Z 
“hh 
4 
= 
ay 
’ 
e 
Bi 
é 
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bi 
4 
4 
i 
& 
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s 
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Ne 


16. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


re 0-—20 
= 0-20 32 
& 20—40 45 
= 40—60 16 
~ 60—80 = 
S 80—100 — 
= Total 93 


The following table gives the 
Calculate the correlation co 


wm 
S 10—20 
"3 10-20 20 
> 20—30 8 
3 30—40 ee 
<  40—50 = 


Construct examples of at lea 
equal to— 1,0 and + 1. 

Two independent variates 
respectively. Show that the 
The variables x and y are cc 

ax +by+c=0 

Show that the correlation cc 
are alike and + 1, if signs ai 


- The independent random vz 


(x) = 4ax 0 <x S$) 
= 0 otherwise 


Find the correlation co-effic 


(a) Show that 
var (x + y) = var(x) + var 
provided x and y are uncorr 
(b) Show that 
ry > or < 0 accord 
If x be the A.M. of x inder 


re) 
var (x) = —.- 
Nn 


If u = ax + by, v= ax — by 
measurements by the same 
yisr. If u and v are uncorre 


Oy = 2ab 6,0, 4 
Ifx, x2, x3 are three variabl 
are uncorrelated, obtain the 
X1, X2 .... X, are random va 
efficient between any two ¢ 
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Pepe a ae 


ynand woollen cloth manufacturer 


ep. Oct. Nov. Dec. 
08 106 104 102 


78 71 80 716 


rch April May June 
118 114 116 120 
62 60 58 54 
[Ans. 0-909] 
1e production of rice and its price 


510 428 320 440 310 
170 136 65 72 58 

[Ans. 0-74] 
aw: 


ple ee ee 
45-50 50-55 55-60 


4 2 — 
5 1 — 
3 1 1 
2 1 1 
[Ans. 0-43] 
ee ot oe 
21 22 Total 
3 1 
3 2 
10 = 17 
a a 9 
= _ 5 
16 3 40 


[Ans. 0:837] 


—25 25—30 ‘Total 


—_ — 9 
— — 29 
7 — 32 
10 4 21 
4 5 9 
21 9 100 


[Ans. 0-802] . 4 
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16. 


Marks in History 


- 0—20 20-40 40-60 60—80 Total 
= 0-20 32; ~—Ss«88 15 os 135 
& 20-—40 45 436 200 4 685 
= 40—60 16 500 398 25 939 
2 60—80 ae 105 532 40 677 
S 80—100 — 8 40 16 64 
= Total 93 1137 1185 85 2500 
[Ans. 0-048} 


17. The following table gives the ages of husbands and wives at the time of their marriages. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 


26. 


Calculate the correlation co-efficient between the ages of husbands and wives : 
Ages of wives 


3 10—20 20-30 30-40 40—50 

"2 10—20 20 26 = — 

= 20—30 8 14 37 — 

§ 30—40 a 4 18 6 

S  40—50 ae _ 4 3 [Ans. 0-69] 


Construct examples of at least 5 pairs of observations with co-efficients of correlation 
equal to- 1, 0 and + 1. 
Two independent variates x and y have means 5 and 10 and variances 4 and 9 
respectively. Show that the variates u = 3x + 4y, v = 3x — y are uncorrelated. 
The variables x and y are connected by the equation 
axt+by+c=0 

Show that the correlation co-efficient between them is — 1, if the signs of ‘a’ and ‘b’ 
are alike and + 1, if signs are different. 
The independent random variables are defined by 
Six) = 4ax 0 <x sr JO’) = 4by 

=0 otherwise =0 


0 sy Ss 
otherwise 


Find the correlation co-efficient between x + y and x — y. 


Ane. a | 
b+a 
(a) Show that 
var (x + y) = var(x) + var(y) 
provided x and y are uncorrelated. 
(b) Show that 
ry > or <0 according as Oy+y > OF < Oy. 


If x be the A.M. of n independent variates x1, x2 .... X, each of s.d. 6, show that 
2 
var (x) = a 
n 


If u = ax + by, v= ax — by, where x, y represent deviations from the means of two 
measurements by the same individuals. The co-efficient of correlation between x and 
y is r. If u and v are uncorrelated, show that 


6,0, = 2ab 6,6, yl a 


If. x1, x2, x3 are three variables with s.ds. 61, 62, 63 respectively. Ifany two of variables 
are uncorrelated, obtain the co-efficient of correlation between x, + x2, and x2 + x3. 
X1, Xp «... X, are random variates each with mean yt and s.d. o. The correlation co- 
efficient between any two of them is p. Show that 
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27. 


28. 


29. 


30. 


31. 


32. 


33. 


34, 


35. 


36. 


MATHEMATICAL STATISTICS 


2 
var(x) = 2 4 ( = i) po- 
n n 


= Xp FX + we +Xy, 
where x = ———*—_———*. Deduce that 
n 
. 1 
p n-1 
x and y are random variates with zero means and unit variances. If 
a 1+2ab 
r(ax + by, bx + ay) = ZF 
( ry 12 aie ES) 
find r(x, y). 


Find the co-efficient of correlation between x and y for the following table : 


yo a) y2 Total 
x4 
i = ap ee Be 
x2 P21 P22 Q 
2 ee Soa | Se (pee 


eee 
x, and x2 are two variates with variances 61’ and 0,’ respectively and r is the co- 
efficient of correlation between them. Determine the value of the constant k such that 
Ans. k = — 2L 
2 
x, and x2 are independent variables with means 5 and 10 and s.ds. 2 and 3 respectively. 
Obtain r (u, v) where u = 3x, + 4x2, v = 3x) — xp. [Ans. r(u, v) = 0] 
Let u = ax + by, v = bx — ay where x and y represent deviations from their respective 


oj 
xX, + kx, and x; + ee X2 are uncorrelated. 
: 2 


- means. If the correlation co-efficient between x and y is p and u, v are uncorrelated, 


show that 
(i) ab (6,’ - 6,”) = po,0, (a’ - B’) 
(ii) 6,’ + 6) =(@ +B’) (67 +6,’) 
(See Ex. 13-14) 
A coin is tossed 7 times. If x and y denote the number of heads and the number of tails 
turned up respectively, find p(x, y). [p is correlation co-efficient]. 
If x, y, z: are three variates each having mean 0, variance 1 and the correlation co- 


. rete 1 : : 
efficient between any two variates is r, show that r >— - What is the corresponding 


result for 1 variates? (Hint. See Ex. 13-19 and Ex. 26) 


Two judges in a beauty contest rank the ten competitors in the following orders : 

6 4 3 1 2 7 9 8 10 5 

4 1 6 7 5 8 10 9 3 2 
Calculate the co-efficient of rank correlation. [Ans. 0-22] 
The ranks of the same 15 students in Mathematics and English were as follows, the 
two numbers within brackets denoting the rank of the same student : 

(1, 10), (2, 7), (3, 2), (4, 6), (5, 4), (6, 8), (7, 3), (8, 1), (9, 11), (10, 15), (11, 9), (12, 5), 
(13, 14), (14, 12), (15, 13) 

Find the rank correlation co-efficient. [Ans. 0-5] 


The co-efficient of rank correlation is 0-8. If the sum of the squares of the differences 
in ranks is 33, find the number of individuals. [Ans. 10] 


37. 


38. 


39. 


40. 


41. 


42. 


43. 


44, 


45. 


CORRELATION CO-EFFICIENT AND 


The table below shows the : 
65 63 67 64 

68 66 68 65 

find co-efficient of rank coi 
For the following data, find 
xe 5 15 10 
y: 21 14 28 


Obtain the lines of regressii 
x: 1 2 3 
y: 9 8 10 
Deduce the value of correl: 
should correspond on the a’ 


Determine Karl Pearson’s 
given in the following table 


Exports : 45 46 
Imports : 94 96 
Obtain also regression equé 

[Ans. 0-96 


Mean soil temperature and 5 
above ground) for winter w 
Mean soiltemp.: 57 42 
No. of days 10 2¢ 
Obtain the regression equat 


Calculate the co-efficient o: 
two tests : 


Student >: A ECE 
Test I : 50 5é 
Test II 22 23% 


Also obtain equations of lu 


The following table gives tk 
and B in an examination : 


Marks in A 


30—36 
30-39 3 
40—49 2 
50—S59 1 
60—69 — 
Total 6 


Calculate the co-efficient o 


The following regression e: 
x = 0-8456y + 5-4! 
y = 0-7326x + 35-! 
Find the values of (7) the cc 
For two variables x and y 


y=axt+bandx=aytB.: 


MATHEMATICAL STATISTICS 
ee 


t variances. If 


for the following table : 
Total 


P 


Q 
1 


5)” respectively and r is the co- 
value of the constant & such that 
a 
Ans. k = —~ — 
O2 
10 and s.ds. 2 and 3 respectively. 
[{Ans. r(u, v) = 0] 
deviations from their respective 
y is p and u, v are uncorrelated, 


c of heads and the number of tails 
co-efficient]. 
iriance 1 and the correlation co- 


1 : ; 
Ss What is the corresponding 


Hint. See Ex. 13-19 and Ex. 26) 
itors in the following orders : 


9 8 10 ‘ 
10 9 3 2 
[Ans. 0-22] 


and English were as follows, the 
he same student : 
»(9, 11), (10, 15), (11, 9), (12, 5), 


[Ans. 0-5] 


1of the squares of the differences- 


[Ans. 10] 
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37. The table below shows the respective heights of 12 fathers and their eldest sons 
65 63 67 64 68 . 62 70 66 68 67 69 71 
68 66 68 65 69 66 68 65 71 67 68 70 
find co-efficient of rank correlation. 
38. For the following data, find the lines of regression : 
x: 5 15 10 20 25 30 
y: 21 14 28 7 35 43 
{Ans. y = 12-6 + 0-68x, x = 9 + 0:347y] 
39. Obtain the lines of regression for the following data : 
x: 1 2 3 4 5 6 7 8 9 
y: 9 8 10 12 11 13 14 16 15 
Deduce the value of correlation co-efficient and also obtain an estimate of y which 
should correspond on the average to x = 6-2. 
[Ans. y = 0-95x + 7:25; x = 0-95y — 6-4; 0-95; 13-14] 
40. Determine Karl Pearson’s co-efficient of correlation between exports and imports 
given in the following table : 
Exports : 45 46 48 50 52 53 51 49 47 
Imports : 94 96 98 100 104 105 102 99 97 
Obtain also regression equations and standard errors of estimate of x and y. 
{Ans. 0:99; y = 1-333x + 34-111; x =0-739y — 24-511, 0-31, 0-42] 
41. Mean soil temperature and germination interval (time between sowing and appearance 
above ground) for winter wheat 1991-96 for 12 places are recorded below : 
Mean soiltemp.: 57 42 38 42 45 42 44 40 46 44 43 40 
No. of days >: 10 26 41 29 27 27 #19 18 19 31 29 33 
Obtain the regression equation of germination interval on mean soil temperature. 
[Ans. y = 80-752 — 1-262x] 


42. Calculate the co-efficient of correlation between the marks secured by 12 students in 
two tests : 
Student >: A B-C D E F G H I J KL 
Test I : 50 54 56 59 60 62 61 65 67 71 71 «74 
Test II > 22 25 34 28 26 30 33 30 28 34 36 40 


Also obtain equations of lines of regression. 
[Ans. 0-774; y = 0-538x — 3-125, x = 1-:115y + 28-493] 
43. The following table gives the number of candidates obtaining marks in two subjects A 
and B in an examination : 


Marks in B 
Marks in A 30—39 40—49 50—59 60—69 Total 
30—39 3 1 1 — 5 
40—49 2 6 1 2 11 
50—59 1 2 2 1 6 
60—69 — 1 1 1 3 
Total 6 10 5 4 25 


Calculate the co-efficient of correlation. Obtain also the lines of regression. 
[Ans. 0-39; y = 0-43x + 26-961; x =0-361y + 30-225] 
44. The following regression equations are obtained from a correlation table : 
x = 0-8456y + 5-45 
y = 0-7326x + 35-86 
Find the values of (i) the correlation co-efficient (ii) the mean of x and y. 
45. For two variables x and y with the same mean, the two regression equations are 
b t-a 


y=ax+bandx=ay + B. Show that B as Find also the common mean. 
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46. 


47. 


48. 


49, 


50. 


51. 


52. 
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Re * q 


Given the regression lines 2y — x = 50, 3y — 2x = 10. Show that the estimate of y for: 


x = 150 is 100 and the estimate of x for y = 100 is 145. Explain the difference. 
Criticize the following : ° 


Dyy = 3-2 and by, = 0:8 


Show that sin 6 < (1 — 77) where 0 is angle between lines of regression. 
l-r? 6,6, 
Hint. We have tan 6 = a en 
r  O% +0, 
2 2 2 
Now Se ote (Gyn0))- >0 
20,0, 20,.0, 
Ox Oy ~ 1 
or tos 2 
1- 2 
tan 6] < —— 
2Ir| 
2 
jeot o> UE 
Ve 
4 4r? (l+r?)? 
1+cot’ 6 2 Le oe 
(l-r*) (l-r*) 
1 2\2 
cosec? 9 > UT 
(l-r*) 
‘ 1-r? bs 2 
sin 0 < 7 $(1-?) (CS 1+P21) 
l+r 


If the lines of regression of y on x and x on y are respectively 
axtbiyt+c,=0 

and ax + boy + 0, =0 

show that aibz < ay by. 

The two lines of regression are given by x + 2y = 5 and 2x + 3y = 8, calculate 

(i) the values of X and y 

(ii) the co-efficient of correlation. 

Show that, if the random variables x and y have the joint p.df. 


x+y O<x<l, 


0<y<l 
x, y) = 0 


elsewhere 


. 1 
then the correlation of x and y is — i 


The random variables x and y are jointly normally distributed and u, v are defined by 
u=xcosatysina 
v=ycos a—x sina 

Show that u and v will be uncorrelated, if 


2p0,6 
tan 2a = SOxOy 
Ox — OF 
where p = correlation co-efficient between x and y. 00 


Multiple and 


14.1. Introduction 


Here the theory of correlation inv: 
aim of the theory of multiple correlat 
upon a variable not included in the gr 

In case, the study of relationship t 
remaining variables on these two var 
eliminate the entire influence, only the | 
the two variables is called partial cor 

Here only three variables will be 


14.2. Notations 


Let x}, x2, x3 be the variables. It is 
respective means, so that 
X= XQ 
The multiple correlation coeffic 
(independent variables) is denoted b 
independent variables and figure befor 
the multiple correlation between x, anc 
The partial correlation coefficient 
after dot refer to variable whose effect 
denote the partial correlation between. 
14.3, Plane of Regression 
The equation of plane of regressio 
; Xj} = ati 
where a, 51.3 and b)3.> are constants, Thi 
coefficients of x, and x, for fixed X3 % 
subscript attached to the b’s is the subs« 
and the second subscript is that of x t 
primary subscripts. The subscript sepi 
of x which has been left. These are call 
Now (1) => 
x} =atl 
(1) takes the form x; = bj3. 
Here the coefficients b’s are to be « 
S = Ly 
which is the sum of the squares of the r 
the variables. 
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: 10. Show that the estimate of y for 


is 145. Explain the difference. 


een lines of regression. 


+P 21) 


e respectively 


= 5 and 2x + 3y = 8, calculate 


: the joint p.d.f 


lly distributed and u, v are defined by | 


nd y. 


oo | 


[a 


Multiple and Partial Correlations 


14.1. Introduction 


Here the theory of correlation involving more than two variables will be discussed, The 
aim of the theory of multiple correlation is to study the joint effect of a group of variables 
upon a variable not included in the group. 

In case, the study of relationship between only two variables is to be made, the effect of 
remaining variables on these two variables should be eliminated. As it is not possible to 
eliminate the entire influence, only the linear effect is eliminated. Then the correlation between 
the two variables is called partial correlation. 

Here only three variables will be taken. 

14.2. Notations 

Let x1, x2, x3 be the variables. It is assumed that these denote the deviations from their 

respective means, so that 
x) _ x2 = X3 =0 

The multiple correlation coefficient between x, (dependent variable) and x2, x3 
(independent variables) is denoted by Rj.23. Where figures after dot (.) correspond to 
independent variables and figure before dot refer to dependent variable. Thus R».)3 denote 
the multiple correlation between xz and x1, x3 and so on. 

The partial correlation coefficient between x, and x2 is denoted by 712.3. Where figures 
after dot refer to variable whose effect has been eliminated or is kept constant. Thus 723. 
denote the partial correlation between x» and X3. 

14.3, Plane of Regression 
The equation of plane of regression of x; on x», x3 is of the form 

Xp = 44+ dj73 x7 + by32 x3 (1) 
where a, by>.3 and b;3.9 are constants. The quantities 17.3 and b;3. are called partial regression 
coefficients of x; and x» for fixed x; and of x, On x3 for fixed x2 respectively. The first 
subscript attached to the b’s is the subscript of the letter on the left (the dependent variable) 
and the second subscript is that of x to which it is attached. These subscripts are called 
primary subscripts. The subscript separated from the primary subscripts by a dot (.) is that 
of x which has been left. These are called secondary subscripts. 

Now (1) = . 

x, = at bis 2 +5132 3 > a=0 
(1) takes the form xy = by23 X2 + by3.2 x3 ..(2) 
Here the coefficients b’s are to be obtained so as to minimize 
S = X(xy — by2.3 x2 — by3.2 x3)” 
which is the sum of the squares of the residuals, the summation is over the given values of 


the variables. 
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The normal equations are 
os 
0 = = —22xp (x1 — 512.3 X2 — 513.2 3) 
0b) 23 
> Xxx. = by73 Ex + by3.2 Lxy x3 
=> N72 01 O2 = by2.3 nO" + by3.2 nrz3 6203 
=> 1120) = by2.3 62 + by3.2 123 63 .(3) 
os 
and 0= = 22x3 (x - by23 X2- 533.2 x3) 
0b 13.2 
2 
=> 2x1x3 = by7.3 Lxox3 + by3.2 D3 
> 1130) = By2.3 62 723 + B32 03 (4) 
Solving (3) and (4) 
A201 1393 2 193 
30) 03 = Oo 3 1 
bi23 = eae | ee | A oe 
92 2393] Gy 23 
302 93 m3 1 
1 72 
G) 1230 «73 
and by3.2 = 53 1 193 
13 1 
For convenience and simplicity, let 
Al 42 ~=43 
MO = |, 22 123 
31 132 133 
and observe Ny = 2 =133= 1 


= 
N 
| 


= 121,113 = 131,123 = 132 


Let 4 = cofactor of (i, /)th place. 


_ j2 m3, _| 1 13 
Then On = one m3 1 
Ly | |f2t 23 2 123 
O12 Tes 13 301 
ee) 72 
ae m1 2) [3 AB 
gS She i Sele 335) 
; Oo; Ol O30 11 


Substituting these values in (2), Eq. of plane of regression of x; on x2, x3 is 


Oo, @ fom] O13 
x, = panic yeas 2 x2 -—" x3 
02 O11 63 O11 
x x x 
Lé., : A opt op papa k @)3 = 0. ...(6) 
o| 82 03 


Similarly eqs. of planes of regression of x2 on x), x3 and x3 on X1, x2 respectively are 


MULTIPLE AND PARTIAL CORRELATIONS 
ph sae mars cal ese aecn diaacsene or eae 


and —~ 03 
Remark. Eliminating b,>.3 and b, 
x] 
A209) 
39] 
x} 
OL 
ié., 2 
n3 71 


which is the eq. of plane of regression | 

Remark. (1) x, x2, x3 are also cons! 
Then value of x, as estimated by plane 
denoted by €.23. Thus 


E423 = dy23 


Similarly 
€213 = ba3 
€312 = b3i2 
The difference x, — € }.23 is the resi 
Thus Xy.23 = X- 
= Xo 
Similarly X43 = XQ-, 
3.12 = X3-5 


(2) In a quantity the subscripts be 
those after dot are called secondary sut 
any order but the order of primary subs: 
left refers to the dependent variable anc 

The order of the quantity is deterr 
Thus x}.23 is of order two, b;.3 is order 

Ex. 14-1. Using the following dat 


x, = 40 
6, =3 
ry2 = 0-4 


find the equation of plane of regression 
30, x3 = 40. 
Sol. We have 


"3 
Eq. of plane of regression of x2 on. 
mal 
— 
S| 21 
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—by2.3 %2 ~ 913.2 *3) 


2X3 
23 0203 
3 (3) 
.— by3-2 X3) 
x3 
33 (4) 
A2 123 
o |/73_ 1 
(oy) 1 13 
Lok} 1 
= 132 
13 
1 
H2 123 
"3 1 
1 72 
3 443 
6, 4 
32 =-— (5), 
03 911 
if regression of x, ON X2, X3 1S 
Oo; @ 
i 21 x%3 
93 011 
x 17 
y+ #3: O13 =0. . HY 
: 03 E 


.X1, X3 and x3 on X41, X2 respectively are 4 
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xy a: x 
—~— 091 2 Wing Female 3 = 0. (7) 
oO} 02 03 
x Xx x 
and ad 03, + wate 032 + aes (033 = 0. (8) 
o| 02 93 


Remark. Eliminating by2.3 and b;3.. between (2), (3) and (4). 
Xp X x3 
M25, 52 19303] = 0 
A391 92723 93 


x) 2 3 
o, 82 03 

ie, m2 1 3/=0 
y3 30 1 


which is the eq. of plane of regression of x, on x, x3 in determinant form. 

Remark. (1) x1, x2, x3 are also considered as the observed values of variates respectively. 
Then value of x; as estimated by plane of regression is 17.3 x2 + 513.2 x3. Let this value be 
denoted by €).23. Thus 

€ 1.23 = by2.3 x2 + by3.2 x3 
Similarly 
E213 = 521.31 + b3.1 x3 
€3.12 = D321 + b324 x2 
The difference x; — € ;.23 is the residual of x;. It is denoted by x).23. 


Thus X4.23 = X1 — € 1.23 
= x1 — by2.3 x2 — b13.2 %3 
Similarly X23 = X2— 1.31 — by3.1 X3 


X3.12 = 2X3 — b31.2%1 — 532.1 X2. 

(2) In a quantity the subscripts before dot (.) are known as primary subscripts and 
those after dot are called secondary subscripts. The secondary subscripts can be written in 
any order but the order of primary subscripts is important. First primary subscript from the 
left refers to the dependent variable and other to independent variable. 

The order of the quantity is determined by the number of secondary subscripts in it. 
Thus x}.23 is of order two, b;2.3 is order one and so on. 

Ex. 14-1. Using the following data 


x} = 40 X2 =70 x3 = 90 
o,;=3 02=6 63=7 
r12 = 0-4 r3 = 0-5 r13 = 0:6 


find the equation of plane of regression of x2 on x; and x3. Also find the value of x2 for x; = 
30, x3 = 40. , 
Sol. We have 
1 Hr 3 1 0-4 0-6 
@ = 142-1 m3/=]0-4° 1 0-55 
hz 3 1 0-6 0:5 1 
Eq. of plane of regression of x2 on x, and x3 is 


x x x 
+o5)+ oy +o; =0 
io] O72 03 
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TE 


Here x), x2, x3 are with zero means. So these are to be replaced by 
X1 —X1,X_ —X_,xX3-X3 
respectively. 
Eq. of plane of regression of x2 on x; and x; is 


Similarly 
Thus “the sum of the product of tv 
as well as secondary) of one occur ar 


MULTIPLE AND PARTIAL CORRELATION 
ee 


2X13.%213 = 0, 


(x) — x) es (x2 —X>) an (x3 -—%3) ars 0 (1) 14.5. Multiple Correlation Coeffici 
oO; roy) 63 Multiple correlation coefficient of 
0-4 0-6 x, and its value is given by the plane 
Now 01 = ale 1 | 0:3 -0-4=-0-1 E,.3 = bia 
ae Let N be the total number of obs« 
@2 = | | = 1-036 =0-64 Now 4.23 = diy 
0-6 1 = 0 
1 4 1 
@23 = ay b3| 7 024-05 =- 026 Var (€ 1.23) = 7 
.. Substituting values in (1), eq. of plane of regression of x2 on x; and x; is _ i 
SS oj ee -_— (0:64) 4 =) _- (-0-26) =0 a 
~ 0-03 (x; — 40) + 0-11(%2 — 70) — 0-04 (x3 — 90) = 0 = by. 
~ 0-03 x1 + 0-11x2 — 0-04 x, -2:9=0 , 

“003 xy — 0-1 1x) + 0-04 x3 + 2-9 =0, = bia, 

Put x, =30, x; =40 

; 0-9 ~0-11 x2 + 16+2-9=0 = or 

0-Lix2 = 5:4 .. X2 = 49-09, 
14.4, Properties of Residuals 
(i) In the derivation of plane of regression of x; on x2, x3, normal equations are ted 
Exp (x1 — by2.3 X2 — by3.2 X3) = 0 5 
and 2x3 (x1 — b12-3 Xz — by 3.2 x3) = 0 _ St 

These equations => Xr) x}.23 = 0 and Lx; x1.23=0 ~ 7; 

Similarly, 2X1 %213 = O and Lx3 %.43=0 > 

2x1%3.12 = O = Lrorx3.12 Zia 

Thus “the sum of the product of any residual of zero order with any other higher order o fi 

residual (having the subscripts of the former as one of its secondary subscripts) is zero.” 

(2) Xxy.2%1.23 = U(X — by2X2) x1.23 ) 
= 2x1X1.23 — by22x2 X4.23 a eels 
= 2xyxX1.23 {using property (1)} a oF, 

Similarly XX} .3X1.23 = 2X 4X1.23 a 2 

Also Ex}.237 = XX}.23 X4.23 = Si 
= U(xy — by2.3 X2 — by3.2 x3) 1.23 O14 
= UN X1.23 — by2-3 Lx. X4.23 — By 3.2 Ex3 X1.23 o? 
= Xx) X1.23 = 

Thus Dxp2 Xy.23 = Exp 3X1.23 = Dey.237 = Ex 1.23 ou 

Similarly =Xx2.1 443° 2X7.3 X2.13 = 249.13 ica =x X2.13 ny 

x31 312 = Ux32 93.42 = Leg = Ly w3.42. where wo = |r, 

Thus, “in the sum of product of any two residuals in which all the secondary subscripts zi 

31 


of first occur among the secondary subscripts of the second, all the secondary subscripts of Zz 

the first can be omitted”. : 
(3) 2v4.2.%312 = U(xy -- by2 X2) X3.42 

i Xx] 3.12 ~ By Dixy 23.42 = 0 


cov (x1, € 1.23) = 
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) be replace 
Similarly 2x}.3 X13 = 0, 2X.1 X3.42 = 0, Xx>.3 X1.23 = 0 etc. 
Thus “the sum of the product of two residuals is zero provided all the subscripts (primary 
as well as secondary) of one occur among the secondary subscripts of the other”. 
(1) 14.5. Multiple Correlation Coefficient 
23 =0 ; Multiple correlation coefficient of. X} ON.X3, x3 is the simple correlation coefficient between 
x, and its value is given by the plane of regression of x; on x3, x3 viz., 
3-0-4=-0-1 €1.23 = by2.3x2 + bi3.2.x3 
Let N be the total number of observations for each variate. 
Now "G23 = by2.3 Xz + by3.5 X3 
)-36 = 0:64 0 
a 2 
4-0-5 =~ 0-26 Var (€ 1.23) Wy & ke 123} 


sssion of x2 On xX; and x3 is 
—90) 


1 
W L{b12.3 x2 + B43. x3}? 


= 2 5 
—— (-0-26) =0 ; 2(x3} > >x3) E(x» 3) 
90) =0 bi2.3 - + bi3.2 ——— + 217.3513. Survie 


2 2 
- SL su) 2% [-2 21s) 2 
92 Ot 63 M1 


9; @ 0; @ 
+ 2[- 1 22 )(-2 1 wee 


M x, x3, normal equations are 57 O14 o3 O11 
2 
eal 2 2 ‘ 
(of, + O]3 + 2793012013) 
=0 Or 


sro order with any other higher order | 
fits secondary subscripts) is zero.” 


— 2 2 
2 ("13723 — M12)" + (r12 13 — 143) 
1 


+ 2723 (r13P23 — 112) ("12 123 — 13)} 


St 25 vad 2 
23 ae {(i3 +13 -2n23%5)(1-rB)} 
{using property (1)} OT 
4 ; 
dey Pee PE 
: — dn +113 -2r2 13 %3} 
I 
3.2 X3) X1.23 : 
1 X1.93 — By3.2 LX3 X1.23 Oj 2 ra) 
ree 3 —— {01-0} = of 41 -—— a) 
O11 O11 
= Lx X}.23 
XX2 X2.13 Al 2 13 
= LN3 13.412 ne where @ M1 2 13 
i i ondary subscripts . ; 
in which all the sec y Pp Ay ye 13 


X3.42 = 0 


COV (X1, € 1.23) = 


D2 aD 
Lp = 745 733 + Qryary3r23. 


1 
— <Xy, a. 
N 1 =1-23 
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1 
W 2x1 (512.3 x2 +-b13.2 x3) 


= by2.3 cov (x1, X2) + b13.2 Cov (x1, X3) 


9) 12 91 13 
a ~ Si @n) 61 O2r12 + | — ~~ —~ | 6103713 
G2 O11 93 M11 


of , 
= ~— {pri + M3713} 
1] 
2 
oO} 
= ae {(r13r23 — M2)ri2 + (n2r23 — 113)r13} 
1 
2 
9] 2 2 
tees {ni + 73 - 2n2n3"%3} 
O11 
o 
2 
a om (2) 
@1] 
> _ o 
R23 = searer 


ns +3 -2n273723 
1-133 
Remark. (1) and (2) => 
Cov(x1, €1.23) = Var (€1.23) 20 
Ri23 2 0 
Also since Ria: .23 is simple correlation coefficient, Rj.23 < 1 
0S Ryw3Sl. 

Ex. 14-2. Three variables have in pairs simple correlation coefficients given by 
r12=-0°8 r13 = 0:7 123 =- 0-9, 

Find the multiple correlation coefficient R.23 of x, on x2 and x3. 

Sol. We have 


1 42 3 1 0-8 -0-7 
wm = |n2 1 n3\= 0-8 1 -0-9! =-07 
"3 «+4193 1 -0-7 -0-9 1 
1 -0-9 
O11 = 0.9 1 =1- 0-81=0-19 $ 
07 2 
R23 = pee ee 
O11 -19 19 
R123 = 0:8. 
Ex. 14-3. Show that RPo3 =l- Stas where 6}.23 denotes the s.d. of x;.73. 
9] 
Sol. We have E123 = by2.3.X2 + by3-2 x3 =X] — X23 


1 
Var (€ 1.23) = ae Lfxq — x1.23)° 


on 2 ee 
Sy r {oj +X23 — 2x4 x23} 
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ates esate TCO 


Ria3 = ( 


Ex. 14-4. Show that ree = 


Il 
— 
i 


| 
] 
hw 


Sol. We have Gas) 


Since (,’s are cofactor in @, 
rr}; @ 


23 SG: 
Sigs 


14.6. Partial Correlation Coefficie 


As already defined, the partial c 
between x, and x2 after the linear eff 


Now linear effect of x3 on.x, as | 


6 
linear effect of x3 on x2 is b23 = 193 : 
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3-2 *3) 


143.2 COV (X1, %3) 


i313} 
aria + M1223 — 113)r133 
inanistas} 


(2) 


re 


»Ria Sl 


ple correlation coefficients given by . 


*x1 on x2 and x3. 


1 0-8 -0-7 
0-8 1 —0:9| =-07 


-~0:7 -0:9 1 
-1— 0-81=0:19 
HS ea e8 

19 


}.23 denotes the s.d. of x1.23. 


=X, — 1-23 


~ 2x, x23} 
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1 
= —Uxf + 
N 


i 


1 
= Bxr55 SP De xy 
N 1-23 N 1 41-23 


2 2 1.3 
= OF +9723 Se pele 


2 2 
= O7 ~ 9}.23 


1 
Cov (x1, € 4.23) = > Lexy (x — X1.23) 


N 


Cov(x), €}.23) 


2 
Sp cs iN Se 
1-23 (s.d. of x, )(s.d. of €1.23 


Ex. 14-4. Show that 6739 =O 


Sol. We have ot 


1 (X — Dy2.3 X2 — By3-2 x3) 
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(See Ex. 14-4) 


= 67 — by.23 Cov (x1, x2) — b13.2 Cov (x1, x3) 


= of =n 0; O2 [- 


= of [tens 


o2 


I 
z (F141 Or + 12 O12 + 713 O13) 


1] 


Since @j’s are cofactor in w, 


12 
@ 


: +h 
i 


bod ia: 
02 11 


© 13 
3 
O11 


. 


ryy Ory +112 Oy2 +143 O13 =O 


2 
9}.32 


14.6. Partial Correlation Coefficient 


. 6] 13 
J- meres [- 21 Se 
3 O11 


As already defined, the partial correlation between x, and x2 is the simple correlation 


between x; and x, after the linear effect.of x3 on them has been eliminated. 


. fe . : o1 
Now linear effect of x3 on.x; as indicated by regression of x; on x3 is bj3 =1ry3 ——- arid 
o 


o 


: ; 2 
linear effect of x3 on X2 is 23 =1r23 —~. 
Go 


3 


3 
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a EEE Ee 
9] 
X13 = X1— 113 X3 
83 
62 
and X23 = X2- 193 4X3 
03 


are parts of x), x2 respectively, which remain after the elimination of linear effect of x3. 
Thus partial correlation coefficient between x), x2 is the simple correlation coefficient 
between x).3, X2.3. 


1 
Now Cov (X1.3, X2.3) = N 2X}.3 X2.3 i X13 = X73 =0} 
1 0} 02 
= => x) —3—_X3 || X2-3 —— %3 
N 03 03 
| ; 0} ; 07 
= —> 1X2 —1)3 ~~ X3XQ —193 —~ X13 
N 03 03 
6, Oo 
+ nabs StS a3 é 
63 93 
0} O2 
= Cov (x1, x2) — 113 —~ Cov (x2, x3) — 23 —~ Cov (x1, x3) 
03 03 
0,0 
+ 3723 +4 05 
03 
= 0102 (112 — 113723) 
L 9 
Var (x13) = = UY 
(x1.3) wy UNS 
ae =x).3 Xx 
N13 #13 
: =x 
= — x 
Noo 
! ~ 
= — 2x1] x13 —- x3 | = Oj (1-73) 
Similarly Var (x23) = 0)? (1= 14) 
= Cov (x1.3, ¥2.3) 
"123 = 


(s.d. of x}.3)(s.d. of x 7.3) 


21S eA 5 erg 
(l-r4)(l-74) = Vn 22 


Remark. If 112.3 = 0, then F\2 = 113 193. 
.. Ifx3 is correlated with x), x. both ie., 3 #0, 13 #0, then 
ry2 # 0 
= XX}, X> are not uncorrelated. 
: X\, x2 are correlated even though they are uncorrelated after the effect of x; is 
eliminated. 
This is because x), x3 carry the effect of x3; on them. 
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Ex. 14-5. Show that 


193-1 
(i) 3. = aS 21/31 ; 
yUl-73)) 1-75} 

43-Nor 
Geta A 12 2 
yl=72)-75 
Sol. (i) We have i 
0 =} 
= } 
= } 
= } 
> bo3.. = 
= 7 
Similarly 539.1 ee a 
(623.1) (b32.1) =r 

ae 
23.1 7 
a 
23. = ~ 
4 
Now 23 = | 
2 = 1 
13.) = 7 
\ 


Similarly prove (ii) 
Ex. 14-6. /n a trivariate distrit 


find 


Sol. 


0) = 3 ¢ 

ry? >= 0:7 ? 

123-1, l 
We have oO = 
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ne 


Ex. 14-5. Show that 


; = 23 12173) 
QQ raa = - 5 
(1~731)(1~73}) 
mination of linear effect of x3. (ii) s A3—-N2"32 


the simple correlation coefficient 


113.2 ; 
Va-73)a-3) 


Sol. (7) We have 
0 = £r.43%34 
= X(x2 — by1.3 x1 — B23. X3) X34 
7 2x2 3.1 — ba1.3 xx, X3-4 — 93.1 Xx3 X3.1 
= Exp. X31 — bp3.) Dx3.17 


2X7. X34 = Cov(x71,%3,) 


{- X13 = X23 =0} 


92 
X13 —~ X3 
03 


= by3.4 = 


4+ 2 2 
X3X2 —-193 = X1X3 2x3) 03 
: = 921 
, = 133-1 6 
| ‘ oy 
Similarly b32.4 = 134 oe 


22 
Cov Spm) — Ts 03 en es) (623.1) (b321) = rf, 


Liste te eee [- S222] 22 2m) 
13.123 03 334 = 
03 53 22 S52 033 


23.) = dana (6234, 632.1, both are with — ve sign) 


Now 23 = 


1 132 
= (12-132) (CS ny = 1) 


22 = 1~ 43,033=1- 73 


32 — 731712 
3.) = oOo 


3) Similarly prove (ii) 
ae Ex. 14-6. /n a trivariate distribution 
— 0; = 3 Oo =4, 03=5 
¥11 22 2 =0-7 r)3 = 0-61, r23 = 0-4 
find 123.415 bi23 and 01.23. 
3 #0, then 1 0-7 0-61 
Sol. We have @ = | 0-7 1 0-4) = 0-32 
correlated after the effect of x3 is | 0-61 0-4 41 


| 1 0-4 


0-4 1 - oe 
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033 


O23 = 


01.23 


Ex. 14-7. Show that1—R 


Deduce that Rj.23 2112 
and 1 + 2rj2 r3 13 27127 
Sol. We have 


2 
1-Ri23 
2 
1—773.2 
2 
1— Ry-23 
2 
L430 
2 
=> (1- R723) 
Now 0 
=> 0<1 


ed 
2 
= Ri23 
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1 0-61 
. ie 1 - wee 
t 0-7 
7 he 1 - 4 
CO 
le ve er 
0-7 0-4 
7 sh 1 |=- os 
_ -O7300 —:027 
~ [59033 (0-63)(0-51) 
= -0-05 
6; @12 3 0-46 
eT Sy iy, OBA 
= 0:54 
o 0-32 
= 6 Dir 0-84 1:85 


(23 = (1 zs ni) (1 cs 732) 


+r13+ 93°. 


O11 
O73 
11 33 
® 11® 33 - 073 
011% 33 
1 


© ©1033 {(1-r8) (1-72) - Gam)? 


1 ee ee 
= {I-73 -n3 —7i+2rsirrs2} 
©1133 
wo 


11 33 


2 
= 33 = 1-72 


AY) 


(2) 
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=> Ry23 «= 
Also (1) and (2) = 

B20) Dina ed _ 

1 ng —793 — 15 + 20: 

= 14 @ryarj3r23 2 nd +h 
Ex. 14-8. Show that three re 


1 m2 3 
ry 1 m3) = 
3) 132 1 


Sol. The equation of three | 


xj x2 : 
TT OH] +— M12 +- 
oO; 02 ¢ 
x x 
7) + oy +- 
OF 04 ( 
x x ; 
+ 03, + oy + 
Oo] oe) 
Planes (1) and (2) will coinc 
OIE, x 
®21 
and planes (2) and (3) will coinc 
ODI? 2. 
31 
First two ratios in (4) => 
11 M22 = 
ce, (trB) (1-28) = 
Le, 1=r, —n —3 + ri: 
Toon. 73 
=> my EA a= 
B12 1 


Similarly other ratios in (4) 
Ex. 14-9. Show that 

by2.3 b3.4 34.2 = 
Sol. R.H.S. = 


Ex. 14-10. Show that the c 


opposite to that between x).3 and 
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= 1-85. 


2 
1 - ry) = (132-731) \ 


Peal +2r31r21732} (1) 
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=> Ry23 2 ry. 
Also (1) and (2) > 


1- nD “ee 4) + 27r3ir2ir32 2 0 
=> 14 2nyrisr3 > re tr, +72 
12013423 = 2 +3 1793 - 
Ex. 14-8. Show that three regression planes coincide iff 


Lote 3 
ry 1 m3] = 0 
rm, 2 1 
Sol. The equation of three planes of regression are 
x] x4 X3 
—- O1, t+ O12 +—— O13 = 0 (1) 
9) a) 93 
gi ee ellos os i a 
02, +—~ O22 +—— 23 =0 (2) 
O| om) 03 
coal ae HS. 
03) +—~ 032 +—~ 033 =0 (3) 
oO) 02 03 
Planes (1) and (2) will coincide iff 
ile 22 Pi, 2 Og (4) 
® 24 227 @23 . 
and planes (2) and (3) will coincide iff 
21 = ® 22 a ® 23 : (5) 
® 3] 032 O32 


First two ratios in (4) => 
@11 O22 = O21 My2 


Bd oD a 2 
ie, (1-733) (1-13) = (132713 — 112) 
be., L = 193-3 +093 73 = 13303 +112 —2rarwris 
Les 1133 — 1g —y + 2rvarsari3 = 0 
Le Ag HS 
= hy Ty} = 0, 
ey ae 


Similarly other ratios in (4) and (5) also imply this condition. 
Ex. 14-9. Show that 

D123 by3.1 631-2 = 11237234 731-2- 
Sol. R.H.S. = 112.3 123-4. 131-2 


7812 —% 23 ~ O31 


V¥11 22 || ¥@22 33 || ¥O33O1] 3 


- |-2 mall 62 mall 93 221 
62 O11 03 22 0) 033 
= byi2.3 b93.4 631. 


Ex. 14-10. Show that the correlation coefficient between x).73 and X.13 is equal ana 
opposite to that between x,.3 and x3.3. 


° 
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Sol. We have ; 


Cov (21.23, X2.13) = W 2X}.23 2.13 


li 


1 
W 2X1.23 (X2 — ba1.3 x — bo3.4 x3) 


ll 


1 
— 13 N 2X1.23 X4 


1 
— by13 W Dxy.237 


2 2 
= — by1.3 94.23 
Cov (1.23, %2.13) 


7(X1.23, X2.413) = Glos oa 


| 

| 
o 

Nv 

ma 

rot) 


It 
| 
-—— 
Q 
N 
S 
~ 
us 
Q 
— 
S 
~ 
=) 
S 


= —721.3 
= — 71X13, X23). 
Ex. 14-11. Show that if x; = ax, + bx, the three partial correlations are numerically 
a 
equal to unity, 73.2 having the sign of a, r3.1, the sign of b and r\.3 the opposite sign of EB 


Sol. Here x), x2 can be regarded as independent and x; is dependent on both of them 
: ry2=0 = Cov (x;,%»2)=0 
=> Xx1x. =0 
Now Var (x3) = Var (ax, + bx2) 
, 03° = a’ Var (x1) + BD? Var (x2) 
= & 62+ oy 


1 
Cov (x1,%3) = N 2X) X3 
1 2 
= — Xx (ax, + bx) = ao, 
N 
aot O| 
Rig ea Ge 
9193 93 
oc 92 
Similarly 3 = b ae 
3 
1372 %2 
P'13-2 9 , 
(1-73)(1-74) 
ag, /oz3 ag| 


|, 2703 eg be o3 
\ we 
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eee 


Similarly 3.) = 


Now 23 = 


according as ab > or <Oi.e., aan 


a 
i.e, ae 
b 
r\2.3 has sign opposite o 
Ex. 14-12, Lf ry3 = 1, show th 
7) 
Rin3 = 
and oto; = 


Sol. We have RP; (I Sy 733) 
Put 2305 
(r\2- m3)" 


Rip; (1- 133) 


2 2n, 
> Ri23 > = ] 
23 =1 


ll 


ll 


1+7193 


By Ex. 14-3, 
2 
01.23 = ¢ 


Ex. 14-13. Show that R?>; =. 


Sol. R.H.S. = £ 
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~ bo3.1 X3) 


0/11 


/M9 


rtial correlations are numerically 
a 
1and 12.3 the opposite sign ir 


x3 is dependent on both of them 


) 

O01 

‘1 

4? 03 


1/2 
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ao| = ao) 

la” of +b? 05 -b? of ([aloy 
= + | according as a> or <0. 
Similarly 13.1 = + 1 according as b> or <0. 


2 773123 


Now 123 = (1-3) (1-3) 


acy bo 
03 93 
a’ of b? 63 
1 - 2 1- 2 
03 03 
— abo, 
y(o3 -a7 of) (03 -b? 3) 
-ab _ 
= —=7] 
|ab| 


according as ab > or < 0 i.e., a and b are of same or opposite signs 


; a 
ie, — >or<0 
b 


. " a 
’12.3 has sign opposite of —. 
Ex. 14-12. /fr3 = 1, show that 


2 2 
Riz3 = 12 = 13 
2 2 
and Ohos Oj] (1-73) 
Sol. We have RPo3 (1-7) al nt) + ni — 2712 713 123 
Put 23 = 1 


2 
(re-n3) = 0 > rp=ry 


2 
Riz (1-733) = 2r2 (1-13) 


2r?, 
= R23 Fl isp pees Pease 
23 Iry3 =I 
By Ex. 14-3, 
2 2 2 2 
Giz3 = Of (1-Rizs) = of (1-3). 
Oo 
Ex. 14-13. Show that RPy3 = by 3% “2 + Bix 743 2 
oO} 0} 
Oo oO 
Sol. R.H.S. = 45.372 2 4 513.2 743 —3 
oO} Oo} 


62 1] Oo} 


l 
— —— {M2712 + My3r13} 
O71] 


- cal en, an - o eu] 58 
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and 
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——a> {riz (131 23 — ra) + r13(r21732 — 731)} 
1-753 


as ara: 
= , {n> +73 —272 73 rs} 
1-153 


= R2 
~ 1-23: 
Ex. 14-14. Show that R\.23=0 = noa=ry3=0. 


Sol. We have 1= Rpys 7 (1-73)(I-32) 


(See Ex. 14-7) 


Put Reo = 0 
2 2 
(1-73) (I-32) = 1 (1) 
SinceO Sry" S1 and 0573, <1, 
(1) is possible only when 
1-7 = 1 > n2=0 | (2) 
1-73, = 1 > n32=0 (3) 
(3) => ris hietsy =O (see Ex. 14-5) 
=> ri3 = O (using 2). 
EXERCISES 
1. Find the regression equation of x3 on x; and x2 given that 
ri2> 0:28 123 > 0-49 3, = 0-51 
0; =2:7 0, =2:4 63 =2:7 


Also find R123, 13.4. 


. Calculate the multiple correlation coefficient of x; on x2 and x3; from the following 
data : 
xy 1 2 3 4 5 8 
X2 2 2 4 2 2 4 
X3 13 . 15 21 17 21 32 


Also find the regression equation of x; on x9, x3. 


. Let x; =seed-hay crop, x2 = rainfall and x; = accumulated temperature. The following 
means, s.ds. and correlations are found 
X, = 28-02, X27 =4-9, X3 =594 
o,=4-4 02 = 1-1, 03 = 85 
X12 = 0-8, ry3 =— 0-4, r3 =— 0-56. 


Find all partial correlations and the regression equations for hay-crop on rainfall and 
accumulated temperature. 


. Let x), x2, x3 are variates with zero means that 
o, =1, O = 1:33, 63=1-9 
ri2> 37, r}3 =— 0-641, r3 = — 0-736. 
Verify that 113.2 = 143-2 where x4 =X, +X. 


~ If x) = V1 +2, X2 =Y2 + y3, X3 = 3 +) where yj, y2, y3 are uncorrelated variables each 


1 
of which has zero mean and unit standard diviation, find R.23. Ans. B 


» Ifry2 =r13 =123 = p (#— 1), show that each partial correlation coefficient is l+p and 


each multiple correlation coefficient is 
pv2 
Vyl+p 


7. 


10. 


11. 


12. 


13. 


14, 
15. 


16. 
17. 
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Also show that 1 - Roy =) 
1, X2, X3 are uncorrelated var 
Let - yp =e 


show that y;, v2, y3 are standa 


. If AX, + AQX2 + A3X3 = k, pro’ 


_ | 


i 


with two similar expressions 
Also show that all the partial q 


. If x1, x2, x3 are three variates 


expected value of x, for given 
and x3, prove that 

_ Cov (%,e1:) = 3 
If x), x2, x3 are standard varia 


E(x2x3) =] 

| 

If 193. ( 
Die, 

Rigg = 1 

and o? 3 = } 

Suppose a computer has foun 

Y2> 0-6, tr 


Examine whether his comput 

(Hint. Find 112.3). 

Comment on the consistancy 
r17 = 0-6, r 

For what value of Rj.23 will x 

If 7\2 and r;3 are given, show 


ryari3 

(Hint. Use Piaa’ < 1). 
If 12 =k; 3 =—k, show that 
A number of persons are mea 
and product moment correlat 
P20 


—_ x x 
Hint. Use Ej} +4 4 
Oo; 932 


Show that bi23 cecal 
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-ro1) + 3121732 — rai)} 


3 rs} 
(See Ex. 14-7) 
(1) 
.(2) 
. (3) 
(see Ex. 14-5) 
5 
ven that 
131 > 0-51 
03> 27 


‘x, on x2 and x3 from the following 


4 5 8 
2 2 4 
17 21 32 


amulated temperature. The following 


¥; =594 
63 = 85 
123 = 0-56. 


:quations for hay-crop on rainfall and 


o3;=19 
3 = — 0-736. 
where X4 = X1 + X2- 
» V2, y3 are uncorrelated variables each 


1 
Ans. —= 
3 
p 


tial correlation coefficient is +p and 


ation, find Rj.23. 


11. 


12. 


13. 


14. 
15. 


16. 
17. 


18. 
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10. 


563 


oy ep) Mepy 


Also show that 1 — Riss 140 


. X}, X9, X3 are uncorrelated variates with same variance. 


X1—-X3 Yo _ *1+%2 +X3 y3 _ 4 4+2x2 +23 
V2" 3” V6 


show that y;, v2, y3 are standard variates. Also find rj2.3 and Ry.23 for y’s. 


Let v= 


. If ax, + aoX2 + a3x3 =k, prove that 


(<3 03 -a? 0? -a3 03) 
= 


2a 147292 
with two similar expressions for r,3-and 73. 
Also show that all the partial quotients are equal to — 1 provided that a’s are all positive. 


. Ifx1, x5, x3 are three variates measured from their respective means and if e; is the 


expected value of x, for given values of x2 and x; from the linear regression of x; on x2 
and x3, prove that 

Cov (x1, e1) = Var (e)) = Var (1) — Var (x1 — e}). 
If x), x2, x3 are standard variates and 


1 
E(x2x3) = E(x\x3) = > Show that 
E(x\x2) >— 5/2. 
If 123 = 0, show that 
2 2 
Rfo3 = 12 +3 
2 2 
and 0793 = 1-2 - 73. 
Suppose a computer has found, for a given set of values of x;, x2 and x3 
ry2> 0-6, 123 = 0-7, 3,-= = 0-4. 


Examine whether his computations may be said to be free from error. 
(Hint. Find r}2.3). 
Comment on the consistancy of 
r}2 = 0-6, r3 = 08, r3, =— 0-5. 
For what value of R}.23 will x2 and x3 be uncorrelated ? 
If r}2 and r;3 are given, show that 723 must lie in the range 


le 


(Ans. No) 


rari3 + (1-73 —13 +112 N13 
(Hint. Use r}.3° < 1). 
If ro =k; 123 =—k, show that r;3 will lie between — 1 and 1 — 2k’. 
A number of persons are measured for heights x,, weights x, and chest expansions 13 
and product moment correlation co-efficients are calculated. Show that 
r\2 ar 123 ot r3,2 — 3/2. 


2 
{ x x x 
Hint. Use £E|—) + 24+] 20 
07} 02 03 


by — 543532 
bi3 = : 


Show that 
1—b73b3p 


QO 
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Sampling Theory and Large Sample Tests 


15.1. Introduction 


Very often in practice one is interested in drawing valid conclusions about a large 
group of individuals or objects. Instead of examining the entire group (which may be difficult 
or impossible) one may think of examining a small part of it. This is done with the aim of 
inferring certain facts about the large group from the results found for smaller part. This 
process is called statistical inference. Various technical words used during this process are 
explained as below : 

Population. Any collection of individuals or of attributes or the results of operations 
which can be specified numerically. 

Finite Population. Population containing finite number of members. Otherwise the 
population is called infinite population e.g., the population of boys in a college and 
the population of pressures at various points in the atmosphere are finite and infinite 
respectively. 

Existent Population. Population of concrete objects e.g., the population of a city. 

Hypothetical Population. Population of non-concrete objects e.g., the population of 
heads and tails obtained by tossing a coin an infinite number of times. 

Sample. 4 part or small section selected from the population is called a sample and 
the process of such selection is called sampling. 

Random Sampling. When a sample is taken in such a way that each member of the 
population has the same chance of being selected, the sample obtained is called random 
sample and the technique is called random sampling. 

Simple Sampling. When a random sample is drawn from a population in such a way 
that the chance of selection of a member at any stage is independent of previous selections, 


the sample obtained is called simple sample and the technique is called simple sampling. 


Stratified Sampling. Jn this process the entire heterogeneous population is divided 
into a number of homogeneous groups (termed as strata) which differ from one another but 
each of these is homogeneous within itself. The samples are drawn from each stratum (the 
sample size in each stratum varying according to the relative importance of the stratum in 
the population). The aggregate of the samples from each of the stratum is called stratified 
sample and the technique is called stratified sampling e.g. To estimate the average income 
of the inhabitants of a city, it is necessary that all sections of the society must be included in 


the sample otherwise there is a likelihood that more rich people or poor people many. be — 


dominating the sample. For this purpose, it is better to divide the city into different strata 
say, according to the localities, slums, middle-class localities and bungalow areas and then 
to draw samples from each of these localities. This would ensure that all sections of the 
society are represented in the sample. 
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Sampling with or without Replacement 


Sampling where each number of a population may be chosen more than once is called 
sampling with replacement and if each member cannot be chosen more than once, it is 
called sampling without replacement. 

Remark. (/) From a finite population, a sample with replacement of any size can be 
drawn without exhausting the population. 

(ii) For most practical purposes sampling from a finite population (which is very large) 
can be considered as sampling from an infinite population. 


Parameters 


A population is considered to be known, if the probability function (or density function) 
J{x) of the associated variable x is known e.g., ifx is normally distributed, the population is 
said to be normal. aes 

Certain quantities may appear in f(x) (e.g., m and © in case of normal distribution). 
Other quantities such as mean, variance etc., can then be obtained in terms of these. Such 
quantities are called population parameters or simply parameters. 

Remark. When the population is given, the population parameters are taken:to be 
known. 

Statistic. It is a statistical measure computed from sample observations alone. 

Remark. Statistic is calculated with the purpose of estimating a population parameter. 
To each population parameter there is a statistic to be computed from the sample. This 
statistic may not always give the best estimate. Once of the important problems of sampling 
theory is to decide how to form a proper sample statistic, so as to get a best estimate of a 
given population parameter. 


Sampling Distribution : 
The statistic is itself a random variate. Its probability distribution is often called sampling 
distribution. It can be thought of as below : 


All possible samples of given size are taken from the population and for each sample 
the statistic is calculated. The values of the statistic form its sampling distribution. 


Standard Errors. The standard deviation of a sampling distribution of a statistic is 
known as standard error and is written as ‘S.E.’. 


Precision. The reciprocal of S.E. is called precision. 
Probable Error (P.E.). /t is defined by 
PE. = (0-67449) S.E. 


Standard Errors of Various Parameters 


o 
] irti 136263 ==: 
(4) Quartiles ia 
o 
ti 7 1-25331 ==: 
(ii) Median Jn 
o 
(iii) SD. . J2n ‘ 
2 
(iv) Variance 3? 2 : 
(l-r? 


(v) Co-efficient of correlation 
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(vi) bs o {S- 
n 


Unbiased Estimate. A statistic ‘t’ is said to be an unbiased estimate of a parameter 
if E(t) =9. 
Asymptotically Unbiased Estimate. A statistic ‘t,,’ is said to be an asymptotically 
unbiased estimate of a parameter 0 if 
Lt E(t,) = 


no 
where n is the size of the sample. 

Large and Small Samples. Samples of size greater than 30 are called large samples 
and of size less than or equal to 30 are called small samples. 

Hypothesis. Very often it is required to make decisions about populations on the basis 
of sample information. Such decisions are called Statistical decisions. In attempting to 
reach decisions, it is often necessary to make assumptions about the population involved. 
Such assumptions, which are not necessarily true, are called statistical hypothesis. 

Null Hypothesis. The hypothesis tested for possible rejection under the assumption 
that it is true is usually called null hypothesis. 

Tests of Significance. Procedures which enable us to decide, on the basis of sample 
information, whether to accept or reject hypothesis or to determine whether observed 
sampling results differ significantly from expected results are called tests of significance, 
rules of decision or tests of hypothesis. 

Level of significance. The probability level below which we reject the hypothesis is 
called the level of significance. 


Confidence Interval 


It is the interval in which a population parameter is expected to lie with certain probability 
(mentioned in percentage). 


The end numbers are called confidence limits or fiducial limits. The probability is 


called confidence level. 


15.2. Sampling of Attributes 

In the case of sampling of attributes we are concerned only with the presence or absence 
of some given attribute. The selection of an individual in sampling may be called a trial and 
the presence of a specified attribute a success and its absence a failure. 

By simple sampling of attributes we mean random sampling in which each trial has the 
same chance of success and in which the chances of success of different trials are independent 
whether the previous trials have been made or not. 


Mean and s.d. 
Suppose we are to draw a simple sample of n individuals from a population. Let p be 
the chance of success and q the chance of failure of each trial. 
Then ptq=i1 
The drawing of a sample is identical with the problem of a series of 1 independent trials 
with constant probability p of success. 
. The probability of 0, 1, 2, . 
ee of (¢ +p)’. 
*, The probability of x successes is given by 
P(x) = "c,p* q"~*, x =0, 1, 2,. 
The binomial probability distribution so obtained is called ‘aauie distribution of the 
number of successes in the sample. ; 


, n successes are the successive terms in the binomial 
(from B.D.) 
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.. The expected value or mean value of the number of successes 


ie. E(x) = np 
and the s.d. of the number of successes 


= Jnpq. 


: x 
Now proportion of successes = — 
n 


x 1 
(=) = —.EQ@)=p 
n 


n 
x 1 
and Var (=) = 5 Var(x) = it 
n n n 
*. Standard deviation of ccna pe ; 
n n 


15.2.1. To test the significance of single proportion for large samples 


Let us suppose that w.rt. attribute A, it is possible to classify individuals of a population 
into two mutually exclusive and collectively exhaustive sets. Let x be the number of 
individuals possessing 4 in a single sample of size n. Then the proportion of individuals 
possessing A is given by 


p= 


Let P be the probability for an individual to possess A i.e., P is the probability of 
success. 


Then x is a binomial variate with expected value nP and s.d. ./nPO.(where Q= 1~P) 


= [se 


pase x-nP__ p-P 
ynPQ [PO 
n 


is a binomial variate with mean zero and s.d. unity. Since sample is large u is nearly a N (0, 1). 
. P(ul>3) = 1-p(-3<u<3)=1-2 P(0<u<3) 

1 — 2(0-49865) = 1 — 0:9973 = 0-0027 

P(\u| > 1-96) = 0-05 

p(\u| > 2:58) = 0-01. 

The probability P is obtained by setting nult hypothesis. On the basis of probabilities 
obtained above the rules for taking decisions are : 

(i) If |u| > 3, the difference between the observed and expected number of successes 
is highly significant'‘and hence the hypothesis is certainly wrong and is to be rejected. 
(ii) If 2-58 < |u| <3, the difference is significant at 1% level of significance. 

(iii) If 1-96 < |u| < 2-58, the difference is significant at 5% level of significance. 

(iv) If |u| < 1-96, the difference is not significant and the data is said to be consistent 
with the hypothesis and hence the hypothesis may be accepted. We can also say that 
divergence may be due to flactuations of sampling — 

Note. (i) The above test is valid only for large samples since for small samples binomial 
distribution may not be nearly normal. 

(ii) The test may furnish evidence against the hypothesis but it cannot prove the 
hypothesis to be correct. It can at the most provide no sie against it. 

(ii?) Since the hypothesis can be rejected but cannot be proved, always null hypothesis 
is set, e.g., to test whether there is any difference, it is assumed that there is no difference : 


Similarly 
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to test whether there is any relationship, it is assumed that there is no relationship etc. The 
rejection of no difference will mean a difference and the rejection of no relationship a 


Set the hypothesis : ‘Die is unb 
relationship. 


Ex. 15-1. A coin is tossed 400 times and it turns up head 216 times. Discuss whether ana P= ; 
the coin may be regarded as unbiased one. 
Sol. Let x be the number of heads obtained and P the prob. of getting head in a toss. 2 
Set the hypothesis : ‘Coin is unbiased’. “uz e- 
1 

Then P= > 
Here n = 400, and x=216 - Difference is highly signific 
1 .. The die cannot be regarded ; 

216-~400.— 

x—nP 1 
u= = = 1-6< 1-96 P#- 
JnPQ Ad 400 ee: 
227° In order to find the limits of F 


successes from the sample. 


.. The hypothesis may be correct and hence the coin may be regarded as unbiased. : 
Now proportion of successes 


Ex. 15-2. Jn some dice-throwing experiment, Weldon threw dice 49,152 times and of 


these 25,145 yielded a, 4, 5 or 6. Is this consistent with the hypothesis that the dice were ae) 
unbiased ? ~ 9 
Sol. Set the hypothesis ‘Dice was unbiased’ .. Estimate of the standard errc 
3 1 oi] 
Then P = prob. of 4, 5 or 6 ina throw = => = \ 
Here n = 49,152 and x = 25,145 ’. Probable limits of P are give 
25145 991325 a Pee X_p 
a = 5: : 
110-85 7 iB i.e. ~ —3(0- 
49152 5-5 0-005 <3i.e., - 3(0 

& Hypothesis is wrong and hence the dice could not be regarded as unbiased. 

Ex. 15-3. Certain crosses of the pea gave 5,321 yellow and 1,804 green seeds. The -. Probable limits of Pare 
expectation is 25% green seeds on a Mendelian hypothesis. Is the divergence significant or 0:36 + 3(0-005) i.e., 0-34! 
might have occurred as due to fluctuations of simple sampling ? Ex. 15-5. Jn a locality of 18,00 

Sol. Total number of seeds (n) = 5321 + 1804 = 7125 these 840 families, 206 families were. 

Here P = expected proportions of green seeds desired to estimate how many out of t 

25 1 ‘ | less. Within what limits would you ph 

; 7 = 100 = 4 Sol. Let p be the proportion of { 

*. The standard error of green seeds Then estimate of p from the sm 
3 qe | | _ 2 

= Te re = 36-6 84 


= . . Estimate of the standard error « 
1804 ~—-7125 : 

u = ——4_—_ = 06 << 1-96 fi 

vc 36:6 . : = val 

.”.. The data is consistent with the hypothesis and hence the divergence may be regarded ~ | 
as due to fluctuations of simple sampling. 

Ex. 15-4. A die is thrown 9,000 times and a throw of 3 or 4 is reckoned as a success. 
Suppose that 3,240 throws of a3 or 4 have been made out. Do the data indicate an unbiased _- 
die ? If not, find the probable limits of prob. of getting 3 or 4. 

Sol. Here. - . n = 9,000 - 

FG x = 3,240 


.. Probable limits of p are 
0:245 + 3(0-015) Ze, 0-20 
The probable limits of the nu 
and 5220. 
Ex. 15-6. A sample of 900 day. 
district and 100 of them are found to be 
of foggy days in the district? 
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Set the hypothesis : ‘Die is unbiased’. 
2 


Th P >.> 
en G 


me 

3 
3240-9000.4 

pS Be ey 


9000-42 
3 3 


.. Difference is highly significant and hence the hypothesis is wrong. 
.. The die cannot be regarded as unbiased. 


Pas 
3 


In order to find the limits of P, we estimate the standard error of the proportion of 
successes from the sample. 


Now proportion of successes 


.. Estimate of the standard error of the proportion of successes 


2 [(0:36) (-0-36) _ [(0-36)(0-64) or 
9000 9000 ne 


’. Probable limits of P are given by 


Pap : ot eae 
A <3 i.e. — —3(0-005) < P< — + 3(0-005 
0-005 n n 


*, Probable limits of P are 
0:36 + 3(0-005) i.e, 0-345 and 0:375. 

Ex. 15-5. /n a locality of 18,000 families a sample of 840 families was selected. Of 
these 840 families, 206 families were found to have a monthly income of Rs. 50 or less. It is 
desired to estimate how many out of the 18,000 families have a monthly income of Rs. 50 or 
less. Within what limits would you place your estimate ? 

Sol. Let p be the proportion of families with income Rs. 50 or less in the locality. 

Then estimate of p from the sample 
206 103 
g40 420 4 
. . Estimate of the standard error of the proportion of families with income Rs. 50 or less 


oo a59) a ~ 0.015 
~ ¥420 420) 840 ~~ 


.. Probable limits of p are 
0-245 + 3(0-015) i.e, 0-20 and 0-229 
The probable limits of the number of families with income Rs. 50 or less are 3600 
and 5220. , 
Ex. 15-6. A sample of 900 days is taken from meteorological records of a certain 
district and 100 of them are found to be foggy. What are the probable limits to the percentage 
of foggy days in the district? 
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Sol. Proportion of foggy days in the district (estimated from the sample) 


900 9 


Estimate of the standard error of the proportion of foggy days in the district 


awtl-3) = 
500 (1-9 } = 9-0105 


Probable limits to the percentage of foggy days are 


100 {0-1111 + 3(0-0105)} i.e, 796% and 14-26%. 


Ex. 15-7. A sample of 500 pineapples was taken from a large consignment and 65 were 
found to be bad. Estimate the proportion of bad pineapples in the consignment, as well as 
the standard error of the estimate. Deduce that the percentage of bad pineapples in the 


consignment almost certainly lies between 8-5 and 17-5. 


Sol. Here 


n = 500, 


p = proportion of bad pineapples in a consignment 


= —— =0-13. 


‘The standard error of the proportion of bad pineapples 


_ [e@=p) _ [0130-87 _ 4, 
= : a = 0-015 


". Probable limits to the percentage of bad pineapples are 
100 {0-13 + 3(0-015)} i.e., 8-5% and 17-5%. 


Ex. 15-8. 4 biased coin was thrown 400 times and head resulted 240 times. Find the 
standard error of the observed proportion of heads and deduce that the probability of getting. 


a head in a.throw of the coin lies almost certain between 0-53 and 0-67. 
Sol. Observed proportion of heads 


.. S.E. of the observed proportion of heads 


(0:6)(0-4) 
400 
= 00245 


". The probability of getting a head in a throw of a coin lies in 


ié., 
Leé., 


0-6 + 3(0-0245) 


0-5265 and 0-6735 
0-53 and 0-67. 
Ex. 15-9. A dealer takes 100 samples from a consignment of 1000 items of a certain 
good and finds that there are 50 items of grade I worth Rs. 5 per thousand, 30 items of grade 
Il worth Rs. 4 per thousand and 20 items of grade III worth Rs. 3 per thousand. Within what- 


limits should the value of the consignment be fixed ? 


Sol. Grade I. 


Proportion of items = 


100 


.. SE. of the proportion of items 


1 iz 
= Vigp 0? (0-5) = 0-05 
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.. Probable limits to the proport 
0-5 + 3(0-05)i.e, 0-35 a 
Grade IT 


30 
Proportion of items = —~ = 0:3 


100 

.. S.E. of the proportion 
-{ 
*. Probable limits to the propor 
0:3 + 3(0:0458) i.e, 0-16 
Grade III 
Proportion of it = 0-2 

oportion =—~ =0: 
p of items = 75 
.. S.E. of the proportion 


’, Probable limits to the propo 
0-2 + 3(0-04) 7.2, 0-08 a 
Now the highest value that can 
grade I is the highest and grade III tk 
Proportion of grade I = 0-65 
and proportion of grade II = 0-08 
, Proportion of grade II = 1 
Highest value of the consi 
= (0 
The lowest value that can be giv: 
the lowest the grade III the highest s 
’ Proportion of grade I= 0-35 
and proportion of grade III = 0-32 
: Proportion of grade II = 1 
The least value of the con 
= (0 
The limits of the value of 
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.. Probable limits to the proportion of items are 
0:5 ¥ 3(0-05) i.e, 0-35 and 0-65 
Grade IT 


30 
P i i Se en: 
roportion of items 100 0-3 


.. S.E. of the proportion 


_ (@307 _, 
= [C22 - ooass 


‘. Probable limits to the proportion are 
0-3 + 3(0-0458) i.e. 0-1626 and 0-4374. 
Grade IIT 


20 
P i i =—— =(). 
roportion of items 100 0-2 
.. S.E. of the proportion 


1 
= V00 (0-2)(0-8) = 0-04 


’. Probable limits to the proportion are 
0-2 ¥ 3(0-04) i.é., 0-08 and.-0-32. 
Now the highest value that can be given to the consignment is that value for which 
grade I is the highest and grade III the lowest so that 
Proportion of grade I = 0-65 
and proportion of grade III = 0-08 
Proportion of grade II = 1 — 0-65 — 0-08 = 0-27 
Highest value of the consignment , 
= (0-65) (5) + (0-27) (4) + (0-08) (3) = Rs. 4-57. 
The lowest value that can be given to the consignment is that value for which grade I is 
the lowest the grade III the highest so that 
’ Proportion of grade I= 0-35 
and proportion of grade III = 0-32 . 
; Proportion of grade II = 1 — 0-35 — 0:32 = 0:33 
The least value of the consignment 
= (0-35) (5) + (0-33) (4) + (0:32)3 = Rs. 4-03. 
The limits of the value of the consignment are Rs. 4-03 and Rs. 4-57. 


EXERCISES 


1. Acoin is tossed 10,000 times and it turns up head 5195 times. Is it reasonable to think 
that the coin is unbiased ? [Ans. No] 

2. In324 throws ofa six-faced die odd points appeared 181 times. Can the die be regarded 
as unbiased ? [Ans, Insignificant at 1% level] 

3. In breeding certain stocks, 408 hairy and 126 glabrous plant were obtained. If the 
expectation is one-fourth glabrous, is the divergence significant or might it have 
occurred as a fluctuation of sampling ? [Ans. Insignificant] 

. Experience has shown that 10% of a manufactured product is of top quality. In one 
day’s production of 400 articles only 50 are of top quality. Does this contradict our 
hypothesis of 10 percent? {Ans. No] 

. Balls are drawn from a bag containing equal number of black and white balls with 
replacement. In 2000 drawings, 1100 black and 900 white balls appear. Is there some 
bias in the drawer ? [Ans. Yes] 
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6. A personnel manager claims that 80% of all single woman hired for secretarial job get 
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married and quit work within two years after they are hired. Test this hypothesis at 5% 


level of significance, if among 200 such secretaries 112 got married within two years 


after they were hired and quit their jobs. [Ans. Hypothesis is wrong] 


- 400 apples are taken from a large consignment and 50 are found to be bad. Estimate 


the percentage of bad apples in the consignment and assign the limits within which 
the percentage lies. 


. Given that, on the average 40 out of 1000 insured men of age 60 die within a year and 


that 60 of a particular group of 1000 such men died within a year, show that this 


group cannot be regarded as representative sample, seeing that the actual deviation of 


the proportion of deaths is more than three times the standard error of the proportion 
for samples of the size. 


- Aman buys 100 sacks of tonatoes. He finds that out of 100 tomatoes chosen from the 


10. 


11. 


12. 


13 


14. 


15. 


sacks at random, 40 are of type A, worth Rs. 10 a sack, 25 are of type B, worth Rs. 7 
a sack, 20 are of type C, worth Rs. 5 a sack and 15 are of class D, worth Rs. 4 per sack. 
What are the upper and lower limits for the value of the tomatoes ? 

[Ans, 634-78 and 835-21] 
12 dice were thrown 6500 times; 4, 5 or 6 being reckoned as a success. What proportion 
of success do you expect ? If in actual observation the proportion of success is found 
to be 0-5016, find the standard deviation of proportion with the given number of 
throws and state whether you would regard the excess of successes as probably 
significant bias in the dice. ‘ 
In a certain maternity home during a year there were 1600 births of which 840 were 
males. Test the hypothesis that male and female births are equally likely. Supposing 
the null hypothesis is not given, determine the + 30 confidence limits for the proportion 
of male births. 
A biased coin was thrown 400 times and heads resulted 240 times. Show that the 
probability of throwing head ina single trial almost certainly lies between 0-53 and 0-67. 
Ina sample of 500 people in Kerala 280 are tea drinkers and the rest are coffee drinkers. 


Can we assume that both coffee and tea are equally popular in this state at 5% level of 


significance ? [Ans. Yes] 


[Ans. 7:5% and 17:5%]° 


A random sample of 16 values from a normal! population showed a mean of 41-5 —~_| 


inches and a sum of squares of deviation from this mean equal to 135 (inch)*. Show 


that the assumption of a mean of 43-5,inches for the population is not reasonable and 


that the 95% confidence limits for this mean are 39-9 and 43-1 inches. 
Show that the prob. that the number of heads in 400 throws of a fair coin lies between 
180 and 220 is approximately 2F(2) — 1 where F(x) denotes the standard normal 
distribution function i.e., . 

; F(x) = P(X Sx), X~N(0, 1) 


15.2-2. Comparison of Large Samples 


Let the two populations be tested for the prevalence of a certain attribute A by taking 


trom them large simple samples of sizes n, and n2 respectively. Let x, and x2 be the number 
of individuals possessing 4 in the two samples. 


ad x2 
Let Pi = — and po= 
Nn 


1 2 
Let P; and P2 be probabilities for an individual to possess A for two populations. 
Then E(x\) = ,P; and hence E(p;) = P; 


: 7 _ AP a 
Var (x1) = mP\Qy and hence var A aseerse 0; =1 —P, 
1 
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Similarly, E(p2) = 
Now E(p, ~— p2) = 
and var (pj — p2) = 
u = 


is approximately a M(0, 1). 
(7) The hypothesis to be teste. 
‘Is the difference (p; — p2) sig 
wit. A’. 
To proceed with we set the hy 
basis of this hypothesis 
P, = 


where Q = 1—P 

we now test the significance with t 
Thus if (i) |u| < 1-96, the hypo 
(if) 1-96 < |u| < 2-58, the diffe 
(iii) 2-58-< |u| <3, the differe: 
(iv) |u| > 3, the hypothesis is not 
Generally P is unknown so w 

unbiased estimate of P is given by 


P=: 


It is unbiased because 


E(P) 


=] 
(ii) The hypothesis to be teste: 
‘Is the real difference between t 
i.e., ifin populations Py < Py, is it 


Lé., Pi-P2 = ( 
i.é., (P, — Pz) + eu < ( 
Le, . us - 
where e= 1 

Now P(u> 1-645) = € 
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to possess A for two populations. 


B(pi) = Pi 
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so (uaa yl P 
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P; 
Similarly, E(p2) = P and var (pr) = 22. 9, =1—P, 
Ny 
Now E(p1 — p2) = E(p\) — E(p2) = Pi — P2 
, P, P. 
and var (p) —p2) = var (p1) + var (p2) = AQ + Poo 
“AY n2 
Ae (P| — P2)-(R - Py) 
AQ, , P:Or 


ny Ny 
is approximately a M(0, 1). 
(i) The hypothesis to be tested is : 


‘Is the difference (p; — p2) significant of a real difference between the two populations 
wit A’. 


To proceed with we set the hypothesis that two populations are similar wrt, A. On the 
basis of this hypothesis 


P, P, = P (say) 


Pi — P2 


where O= 1-—P 
we now test the significance with the aid of the normal curve. 
Thus if (7) |u| < 1-96, the hypothesis is acceptable at 5% level of significance. 
(ii) 1-96 <|u|< 2-58, the difference is significant at 5% level of significance. 
(iii) 2-58 <u| <3, the difference is significant at 1% level of significance. 
(iv) |u|>3, the hypothesis is not acceptable and hence the difference is highly significant. 


Generally P is unknown so we have to estimate it from the sample proportions. An 
unbiased estimate of P is given by aaa 


pve MP *N2P2 _ Xi +2 
ny +n ny +n 
It is unbiased because 
E(P) _ E(x; +x2) _‘mA +n2P) 
ny +N mtn 
=P (. Py =P. =P) 


(ii) The hypothesis to be tested is : 
‘Is the real difference between the populations likely to be hidden in two samples drawn’ 
i.e. ifin populations P; < P,, is it likely that p; and p2 will be s.t. p; $ po 
Le., Pi-P2 © 0 


Leé., (P, —P2)+eu < 0 


A-P; 
Le., us ae eae 2) 
e 
; — fA , BQ 
where e= my ity 
1-645 


Now P(u > 1-645) 


0-5- | dP=0-5-0-45=0-05 
0 
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P(u<-— 1-645) = 0-05 
PP. 
if orcs > 1-645 
P-P 
ie, Se) aes 
e@ 


P-P. 
Plu < -G-2)) < 0:05 


which implies that at level, it is unlikely that difference will be hidden in simple sampling. 


B- 

Similarly, if | 
significance. 

Ex. 15-10. /n two large populations there are 30% and 25% respectively of fair haired 
people. Is this' difference likely to be hidden in samples of 1200 and 900 respectively from 
the two populations ? 

Sol. Here, P; = 0-3, P2 = 0-25, n; = 1200 and nz = 900 

(0-3)(0-7) | (0-25)(0-75) 


— tH Or C= 0 1 
e 1200 900 a 


Be a O20 561.645) 
e 0-0195 ~ 
At 5% level the difference is unlikely to be hidden. 

Ex. 15-11. /n a simple sample of 600 men from a certain large city, 400 are found to be 
smokers. In one of 900 from another large city, 450 are smokers. Do the data indicate that 
the cities are significantly different with respect to the prevalence of smoking among men ? 

Sol. Set the hypothesis : Two cities do not differ significantly w.rt. the prevalence of 
smoking among men. 


> 2-327, the difference is unlikely to be hidden at 1% level of 


n, = 600, x, = 400 
nz = 900, x2 = 450 


oe 
pi = 3 and pr 
X,+X2 850 17 


p= ez ae 
mtnz 1500 30 
13 
ese a 
re renee 
mM 30 30 \600 900 
= 0-000682 
e = 0:026 / 
201 | 
“ut PUSS, Bi: 2 =6.4(>3) 
e 0-026 


’. The difference is highly significant and hence the two cities are significantly different 
with respect to the prevalence of smoking habit among men. 

Ex. 15-12. A railway company installed two sets of 50 Burmaties each. The two sets 
were treated with creosate by two different processes. After a number of years of service it 
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was found that 22 ties of first set a. 
Are we justified in claiming that the 
of the two processes ? 
Sol. Set the hypothesis : Ther 
of two processes. 
Here Pi = 
P= 
uoe 
*. Data provides no evidence 
Ey. 15-13. Ina referendum su 
566 women voted. 530 of the men 
significant difference of opinion oi 
students ? 
Sol. Set the hypothesis : The 
and women on the matter. 
Here P= 


P= 


ahs o> ; 

.. Hypothesis is wrong. 

Ex. 15-14. On the basis of t. 
examination are divided into two g. 
the first question of this examinati 
whereas among the second group § 
can one conclude that the first qu 
being examined here ? 

Sol. Set the hypothesis : The 
type being examined. 


30 
Here n; = T00 ‘200 = 60, 1. 
Pi = 


P 


uz 
The data is consistént w 
Ex. 15-15. /n a year there are' 
in towns A and B combined this pre 
significant difference in the propor 
Sol. Set the hypothesis : The 
births in the two towns. 
Here ny = 956, 


Pi = 0-525, 
Pr = 0-434 
u = 322 (>3) 


= dHiypothesies is wrong. 
Ex. 15-16. A machine puts out 
overhauled it puts out 3 imperfect ar 
Sol. Set the hypothesis : Mac 
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00 
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17 
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the two cities are significantly different 
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was found that 22 ties of first set and 18 ties of the second set were still in good condition. 
Are we justified in claiming that there is no real difference between the preserving properties 
of the two processes ? 

Sol. Set the hypothesis : There is no real difference between the preserving properties 
of two processes. 


Here pi = 0-44, po = 0-36 
P = 0-4, Q=0-6 and e=0-098 
u = 0-8 < 1:96 


‘. Data provides no evidence against the hypothesis. 

Ey. 15-13. Ina referendum submitted to the student body at a university 850 men and 
566 women voted. 530 of the men and 304 of the women voted yes. Does this indicate a 
significant difference of opinion on the matter, at the 1% level, between men and women 
students ? 

Sol. Set the hypothesis : There is no significant difference of opinion between men 
and women on the matter. 


Here Pi = 0:6235, p2=0-5371 
pee Bao. ved’ soo 
ig? = ate. 
= 3-23) 


.. Hypothesis is wrong. 

Ex. 15-14. On the basis of their total scores, the 200 candidates at a civil service 
examination are divided into two groups, the upper 30% and the remaining 70%. Consider 
the first question of this examination. Among the first group, 40 had the correct answer; 
whereas among the second group 80 had the correct answer. On the basis of these results, 
can one conclude that the first question is no good at discriminating ability of the type 
being examined here ? 

Sol. Set the hypothesis : The first question is no good at discriminating ability of the 
type being examined. 


30 
= — -200=60, n.= 140, x, =40, and x2 = 80 


Here 1 
100 
py = 0:6667 and-p, = 0-5714 


ae | 
P= 06 =0-4 d e= TF 
, Qa _ and e 5/7 


uz 1:26 (< 1-96) 
The data is consistént with the hypothesis. 

Ex. 15-15. /na year there are 956 births in a town A of which 52-5% were males, while 
in towns A and B combined this proportion in a total of 1406 births was 0-496. Is there any 
significant difference in the proportion of male births in the two wnt? 

Sol. Set the hypothesis : There is no significant difference in th proportion of male 
births in the two towns. 


Here n, = 956, ny tn = 1406 o% nz = 450 
Pi = 0525, P = 0-496 as QO = 0-504 
P2 = 0-434 ; 
u = 32 ©3) 


.. Hypothesis is wrong. 

Ex. 15-16. 4 machine puts out 16 imperfect articles in a sample of 500. After machine is 
overhauled it puts out 3 imperfect articles in a batch of 100. Has the machine been improved ? 
Sol. Set the hypothesis : Machine has not been improved. 
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H = 0.032, p.=0.03, P= 2, g=28t 
sos Py Pe 600° ~ 600 


u = 01 (< 1-96) 

.. Hypothesis may be correct. 

Ex. 15-17. Ina large city A, 20% of a random sample of 900 school boys had a certain 
slight physical defect. In another large city B 18-5% of a random sample of 1600 school 
boys had the same defect. 1s the difference between the proportions significant ? 

Sol. Here Pi = 0:2, p.=0:185, P=0:19, O=0-81 

ee = 0-92 (< 1-96) 

’, Difference is not significant. 

Ex. 15-18. (a) Two large random samples of sizes n, and nz are taken from two 
populations. If p, and pz be the proportions of members possessing the attribute in two 
samples, give procedure of testing the significance of the difference between p, and 


— MPit2P2 
ny +N 
(b) In two random samples of 400 and 500 students from two different colleges, 300 
students in each were found to be failed in an examination. Find out whether the proportion 
of failures in first college is significantly greater than the proportion of failures in two 
colleges taken together. 
Sol. (a) Let p, and p be the expected values of p; and p. Then 
— _ MPi+N2P2 
P ny +n 
where pz = E(p2) 
Now cov (pi, p) = 7 ~ P1)(p-B) 


El(p) — Py) tm (Py — Pr) +12 (P2 -— P2)}) 
ny +N4 


1 aS = = 
= fn E(p\ —D,)” +n E(p; ~ P1)(P2 - Py) 
ny +4 


n 
= — var (p1) 


ny +N 


(. cov (pi, p2) =0 as pj, p2 are independent) - 


PQ 
Now mi gives the estimate of population proportion and hence var (p;) = m and var 
(p2) = s 
a 2s 


cov ep)'= ny +n 


1 
Now var (p) = ————z_ var (mp; + n2p2) 
' (1 +12) 


= — 5 {m2 var (p,) + m3? var (2) 
(nj +n) 
Pa 
: ny +N» 
var (p~ pi) = var (p) + var (pi) - 2 cov (pi, ) 
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Assuming the hypothesis tha 
test statistic becomes 


which is a (0, 1) as 1, and np are 


(b) Here n, = 400, n. = 501 


p= 


p and p, are significantly 


1. Ina random sample of 500 
sample of 400 from town B, 2 
the data reveal a significant 
among persons is concerned 

2. Ina large city A, 20% of a rai 
In another large city B, 15-5‘ 
defect. Is the difference betv 


3. Inarandom sample of 500 m 
In one of 1000 men from anc 
the two districts are signific 
men? 

4. Fromeach of two consignmei 
of rotton eggs counted. Te 
consignments are significant! 


Sample from consignment A 
Sample from consignment B 


5. In two large populations there 
likely to be hidden in simple 
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19 581 ; mtng, Mm m+n 
600’ ~ 600 | eee Ee 
ny m+n 
Assuming the hypothesis that there is no significant difference between p; and p, the 

of 900 school boys had a certain test statistic becomes 
random sample of 1600 school . Pra 
oportions significant ? “u= ! 
0:19, Q= 0°81 iL? SED ste Md 

: ny m+Ny 


which is a (0, 1) as n; and n} are large. 
»§ ny and Nn are taken from two 


3 3 
s possessing the attribute in two eee ae 
difference between p, and : 
3008300 2 
~ 400+500 3 
; from two different colleges, 300 pie 2 ; 1 
1. Find out whether the proportion e=3 
the proportion of. UgHine: in two - 
and p. Then nee du. Log5 3) 


p and p, are significantly different. Obviously p > p. 


EXERCISES 

1.. In.a random sample of 500. persons from town A, 200 are found to be smokers. In a 
sample of 400 from town B, 200 are found to be smokers. Discuss the question whether 
the data reveal a significant difference between A and B so far as the smoking habit 
among persons is concerned. [Ans. Significant] 

2. Ina large city A, 20% of a random sample of 900 schoolboys had defective eye-sight. 
In another large city B, 15-5% of a random sample of 1600 schoolboys had the same 
defect. Is the difference between the two proportions significant? 

[Ans. Not Significant]: 

3. Inarandom sample of 500 men froma particular district, 300 are found to be smokers. 
In one of 1000 men from another district, 550 are smokers. Do the data indicate that 
the two districts are significantly different w.r.t the prevalence of smoking among 
men? [Ans. Not Significant] 

4. From each of two consignments of eggs, a sample of size 200 is drawn and the number 
of rotton eggs counted. Test whether the proportion of rotton eggs in the two 


2 (py — Pi) +2 (P2 ~ P23) 


+)? +n E(p, — Pi) (P2 ~ P23 


p2)=0 as pi, p2 are independent) 


Pq 
4 and hence var (p1) = n and var 


9) + n2p2) consignments are significantly different or not, given that 
Size of sample No. of rotton eggs 
‘ (py) + ng? var (P2)} Sample from consignment A 200 40 
Sample from consignment B 200 30 
[Ans. Not Significant] 


5. In two large populations there are 35% and 30% of fair haired people. Is the difference 


2 cov (p1, P) likely to be hidden in simple samples of 1500 and.1000 ? [Ans. e = 0-019, unlikely] 
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6. A machine produces 30 defective screws ina lot of 1000. After overhauling it produced 
20 defective in a lot of 800. Set-up a statistical hypothesis and test it. [Ans. u = 0-6] 
Ex. 15-19. Show that standard error of the number of success is the square root of the 
mean number of successes provided the mean proportion of successes is small. 
Sol. Mean proportion of successes = p 
.. It p is small, standard error of the number of successes 


vnpq = np(1-p) 
pad np 
= ,/Mean number of successes . 


Ex. 15-20. Show that precision of the proportion of successes varies as the square root 
of the number of members in the sample. 

Ex. 15-21. /f for one half of n events, the chance of success is p and the chance of 
failure is q, whilst for the other half the chance of success is q and the chance of failure is p. 
Show that the standard deviation of the number of successes is the same as if the chance of 


n 
success were p in all the cases i.e., npq but that the mean of the number of successes is > 


and not np. 

Sol. Let x; and x2 denote the number of successes in two halves. 

n n << 
Then E(x) = 5 P, var (41) = 3 Pa 
n n 
and EQ) = 5 % var (x2) = 5 Pd 
n n 
E(x, + x2) = 5 etD=5 
var (x; + x2) = var (x;) + var (x2) [. The halves are independent] 


i ee rele 
5 PI+ > PI = np4. 


Ex. 15-22. The sex ratio at birth is sometimes given by the ratio of male to female 
births, instead of the proportion of male to total births. Ifz is the ratio i.e., z= - Show that 


1 Zz 
the standard error of z is approximately Tas {E , n being large so that deviations are 


small compared with the mean. 
Sol. Let x be the number of male births. Then (1 — x) is the number of female births.’ 


Now z= PoP 
q 1-p 
pt ee eee oe 
oe, 14+z a 1+ 
-1 . 
Also pee -2.11-4} == (1425 i IES 
nH-xX n A n n n 
S.E. ofz = SE. ot(*) = Pq 
n 
up See 
l+z Vn 


Ex. 15-23. n individuals fall into one or the other two categories with probabilities p 


and q(= 1 —p), the number in two categories being n, and nz. Show that cov (n, n2)=—npq. - 
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. n n 
Hence obtain var (* ~ 2), 
n 


n 

Sol. Evidently ny 
ow E(n) 

E(n) 

cov(7), nz) 


15.3. Sample Mean 


A sample of size n can be « 
4) X, denote the random variab] 
variable defined by 


x 


The probability distribution 
or the sampling distribution of 1 
Remark. In the case of rar 


15.3.1. Central Limit Theorer 
Statement. Ifx,, x9, .....,x,t 


h 

mean ul and s.d. o and X = Z > 
n¢ 

i= 


has distribution that approaches 


Proof. M(t) of z = 
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[. The halves are independent] 
q 
given by the ratio of male to female 


: P 
:. [fz is the ratio 1.e., z= q , show that 


n being large so that deviations are 


4~ x) is the number of female births. 
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mo on 
Hence obtain var (* = 2) 
n 


n 
Sol. Evidently Ny = n-ny, 
Now E(n\) = np 
En) = n—E(m)=n-np = ng 


E{(y — np) (m2 — nq)} 
E{(m — np) (n— 1, —1g)} 
— E(ny — np) = — var (ny) =— npq 


cov(n lo n>) 


= yoo (n, — 2) 
[var (11) + var (717) — 2 cov (1, 72)] 


[npg + npq + 2npq] 


Ss x 
amma Oe fo 


15.3. Sample Mean 

A sample of size n can be described by the values of the random variables. Let x), x2, 
...; Xp denote the random variables for a sample of size n. Then the sample mean is a random 
variable defined by 

s _ +XQ +....+Xy 
7 n 

The probability distribution of ¥ is called sampling distribution for the sample mean ¥ . 
or the sampling distribution of mean. 

Remark. In the case of random sample, x1, x2 
15.3.1. Central Limit Theorem 

Statement. Ifx;, x3, .....,X, be n independent random variables all with same distribution, 


sab , X, are independent. 


ZzZ=— 


- 1¢< ? : : 
mean pt and s.d. o and x = — Y x;, then, if m.g.f- of x; exist, the variate 
n? 
i=] 


has distribution that approaches the standard normal. distribution as n —> °. 


{E=Y 
o 


Proof. M((t) ofz = E4e 


_ phn iw = 
=e Go Ese © 
at (x +29 +o tXp) 
= @ 9 E eovn 
uvn t t t 
a 2 — =X 
eo Ce ee eon 
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uvn t t 
~t x x —=x 
=e Oo E eovn E dein OMe ee Eleovn r 


(as x’s are independent) 
Now since all x; has same distribution, mean and s.d. their m.g.f. will also be same. 


Mijoip=e. @ {mM (- : } 
Ovn 


t 


Ovn 


where Mp ( is the m.g.f. of x;. 


log {M)(t) of z}= iy we + nlog {Mo (=| 


; 3 
= AEE noe i Ore + HO ( ‘ +... 
o 


ovn 2! 


Now pi’, (0) = p. 


2 
log {Mo (t) of z} = pee Ol + terms containing n in the 
denominator. = 


reg F r 
Lt_ log {Mo (t)ofz} = —> [n’2 (0)- {u's (0)}7] = — 
n-—>0o 20 ae 


[2 w'2(0)- W's ()}2 == 07] 
12 


Lt Mo(t) ofz = e? 


n->o 


which is the m.g.f. of a M(0, 1) 
As n > © the distribution of z tends to the standard normal distribution. 


15.3-2. The standard error of the mean of a random sample of size n froma population 
with variance o? 
Sol. Let x), x2, .... x, be a random sample. 


= Xp tX7 +. $x 
Then x =, Slee 
n 


is the sample mean. 


var (x) = + [var (x) +.x2 +... +x,)] 


= [var (x) + var(x2) + .... + var (x,)] 
ri 


(as x’s are independent) 
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S.E. of x 


q 


2 las Q 


15.4. Sampling of Variables 
In this case population is the freq 
provides a value of the variable. Dra 
the variable from those of the distrib 
15.4-1. Unbiased Estimate of Popt 
Let the values Xj, 5, ...., Xv con: 


Xx 
Then b= 
Let x1, x2, ....., X, be a random sa 
. 1 
lee ees 
n 
E(x) = Es 
n 
Now for fixed i, x; can take any « 
i 
ae 
1 
E(x;) = W 
BG) =~ + 
N 


Sample mean X is an unbi: 


15.4-2. Unbiased Estimate of Popul 
Sample s.d. is 


ee 

ny 

_1 

n, 

es, 

n 

a 

nj 

1, 

Es’) = — 

ni; 

Since E(x) = p= 
and E(X - p)* = var 


- E(s’) = o°- 


~ 
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<3 
x Xn SAO: ais i xe 
eovl 55 ee Es eox? 7 (. var (xj) = 0° for all ‘i’) 
: S.E. of x 2 
i + E.ofx = -=. 
(as x’s are independent) = 
their m.g.f. will also be same. 15.4. Sampling of Variables 


In this case population is the frequency distribution of the variable and its each member 
provides a value of the variable. Drawing of a sample is same as choosing certain values of 
the variable from those of the distribution. 


15.4-1. Unbiased Estimate of Population Mean 
Let the values X;, X9, ...., Xx constitute a finite population with mean p and variance 0’, 


Xt Xo +...4X 
Then Lh = eee 


N 
| Let x1, X2, ....., X, be a random sample from the population. The sample mean is 
2 Zar, lige 
+. #20)(—r.) ay el | * soe 
_ 1 
P ) | Be E(x) = — > E(x;) 
+ 


n: 
a2,t =1 : euch 
ai $y? Now for fixed i, x; can take any ° e of the values Xj, X9, ...., Xy each with probability 


a 
Ai N’ 
1 
E(xi) = W (X1+%2+.... Ay) =H 
ote : = 1 
*] + terms containing n in the ca BG) = 7 Due 
, i=] 
> Sample mean x is an unbiased estimate of the population mean. 
0)}"] = [ 15.4-2. Unbiased Estimate of Population Variance 


Sample s.d. is 


12 (0) — {u's (0)}? = pa = 07] 
> (4; -x)? 


22> Genres’ 
dard normal distribution. M jz] 


. : ° 1 n > o f 
imple of size n from a population _ = > {(x;- pw)? +(u-x)? +2(m-X) (x; -w)} 


> Gj -H)? -(u-%)? 


i 
n 

E(s’) = >) E(x;-p)? -E(@-p)’. 
pe 


-+ Xn) Since E(x) = p= E(%), E(x;—p)* =var(x))=0" 


ee 
E(% — yu)” = var (%) = — 
o 


Es?) = ef -— 
Hn 


2)+.... + var (%n)] 


(as x’s are independent) 
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Ex. 15-24. 4 sample of 400 
Can it be reasonably regarded as c 
and s.d. 1:3” ? 

Sol. Here n= 

: ie 
.. The sample may be regar 
g.d. 1:3”, 


ll 
ro 
= 
=] 1 
_ 
Soy 
Q 
i) 


¥ , n 
hk in E| —— s? = oc 
n-1 
Unbiased estimate of population variance 


ee fem oe oe ee 
Sa a (x; -X)" = S* (say) Ex. 15-25. 4 sample of 900 
reasonably regarded as a simple 
15.4-3. Test of Significance of Single Mean 2-61 cm.? 


Sol. Here n= 

sy ZS 

.. The sample can be regard 
2-61. 

Ex. 15.26. Mean of 10 readii 
of measurements is known to be ( 
length of the rod is 19-9” ? 

Sol. Here w= 

a z= 

Sample contradicts the gi 

Ex. 15-27. A sample of 900 ie 

regarded as a random sample from 


Consider a large random sample with mean x froma large population with mean yl and 


o 
s.d.o. Then ¥ ~ N | p, —= 
: ( é) 
AW cnet) 
Z o / Jn > 
The hypothesis to be tested here is the sample has been drawn from a population with 


mean Wl and s.d. o. 
The significance is tested with the aid of normal curve and the rules of taking decisions 


are same as before. 
15.4-4, Confidence Limits or Fiducial Limits 
Consider a large random sample of size n with mean x from a population (not necessarily 


normal) with mean wv and s.d. 6. Then Sol. Here n=. 
=: oe 2 
je ee At 1% level sample cannot be 

o/vn s.d. 2:3. 


is nearly a N(O, 1). Ifo be known but not yy, there is a range of possible values of for which 
x is not significant at any specified level of probability. If X is not significant at 5% level of 
probability, then since 


Ex. 15-28. The mean of a cer 
the mean of the samples of 100 froi 
the sample of 25 from the distribu 


P{i\z| > 1-96} = 0-05, . 
pL must be st Sol. Let 1 and o be the mean 
oh Let x be the mean of the sam 
< 1:96 
o/ Vn z= 
£4196 Spe Fe1-96— 


Tn Tn z z= 


The values ¥¥1-96—= are called 95% Fiducial Limits or Confidence Limits for the 


n ; o P(X < 0) = 
mean of the population corresponding to the given sample. The interval ae to 
n 
41-967 is called 95% Confidence Interval. = 
nh 
Similarly since P{\z| > 2-58} = 0-01, 99% Fiducial limits for the population mean are = 
55 2-59-2. Ex. 15-29. A normal populati 
Vn the mean of a simple sample of size 
In general if P{\z| > 2’} = P’, 100 (1 — P’)%. Fiducial limits are x + pasa Sol. Here w= 4 
n 
Evidently the limit vary from sample to sample. The totality of values of limits (for ee | 
given P’) for different samples determine the field within which yp is asserted to lie. This . 
field is known as Confidence Belt. < , aoe 
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(x, -)? =S? (say). 


ma large population with mean pt and 


as been drawn from a population with 


curve and the rules of taking decisions 


an x froma population (not necessarily 


range of possible values of for which 
lity. If is not significant at 5% level of 


1 


jal Limits or Confidence Limits for the 
7 o 
ven sample. The interval X16 t 


tucial limits for the population mean are: 


se Oo 
Fiducial limits are X + z'—=- 


n 

iple. The totality of values of limits 

Id within which w is asserted to lie. 
# ia: 
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Ex. 15-24. A sample of 400 male students is found to have a mean height of 67-47’. 
Can it be reasonably regarded as a sample from a large population with mean height 67-39” 
and s.d. 1:3”? 

Sol. Here n = 400, x = 67-47", = 67-39” and o = 1-3” 

: Zz 1-23 (< 1-96). 

The sample may be regarded as drawn from the population with mean 67-39” and 
s.d. 1-3”, 

Ex. 15-25. A sample of 900 members is found to have a mean of 3-4 cm. Can it be 
reasonably regarded as a simple sample from a large population with mean 3-25 em s.d. 
2-61 cm.? 

Sol. Here n = 900, X =3-4, w=3-25 ando=2-61. 

ie z= 1:7(< 1-96) 

.. The sample can be regarded as drawn from a population with mean 3-25 and s.d. 
2:61. 

Ex. 15.26. Mean of 10 readings on the length of a given rod is 20”. The s.d. of errors 
of measurements is known to be 0-1". Does the result contradict the assumption that the 
length of the rod is 19-9” ? 

Sol. Here 


Hl 


ll 


H = 19-9,0=0-1,n= 10, and x = 20, 
z = 3-162 (>3) 
Sample contradicts the given assumption. 

Ex. 15-27. A sample of 900 members is found to have mean 3-5 cms. Can it be reasonably 
regarded as arandom sample from a large population with mean 3-3 cms and s.d. 2:3 cms ? 

Sol. Here n = 900, xX =3-5,n=2-3 ando=2°3. 

a Zz = 2-6 (> 2-58), 

At 1% level sample cannot be regarded as drawn from a population with mean 3-3 and 
s.d. 2:3. 

Ex. 15-28. The mean of a certain normal population is equal to the standard error of 
the mean of the samples of 100 from that distribution. Find the probability that the mean of 
the sample of 25 from the distribution will be negative. 

o 


i08 
vn 10° 


Sol. Let p and o be the mean and s.d. of the population. Then p = 


Let x be the mean of the sample of size 25. 


_ *-H _ x-0/10 5x 1 Hoek 
7 61 N25 o/5 o > ~ NO, 1) 


x = es 
a5. 10 
eo ae ; 
x ace —z+—<0/=Pi z<-— 
me ($s 10 (- 5) 


= 0-5 —(0-1915) = 0-3085 (from tables). 


Ex. 15-29. A normal population has mean 0-1 and s.d. 2-1. Find the probability that 
the mean of a simple sample of size 900 will be negative. 


Sol. Here BH = 0-1, o=2-1, n= 900, 
= (= 30~N(0, 1 
z mi ~N(0, 1) 


0-07z + 0-1 


=] 
ul 
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3) 


P(z > 1-43) = 0-5 — P(0<z< 1-43) 
= 0-5 — 0-4236 = 0-0764. 

Ex. 15-30. A research worker wishes to estimate the mean of a population, using a 
sample sufficiently large, such that the probability will be 0-95 that sample mean will not 
differ from the true mean by more than 25% of the s.d. How large a sample should be 
taken ? 

Sol. Let n be the size of the sample. 

Now P{\x-pl < 0:25 o} =0-95 


Plz] < 0-25 Jn) =0-95 


P(x <0) 


| 


P(O<z < 0:25 Jn) =0-4750 
0-25 av = 1-96 
= 62. 

Ex. 15-31. Ifthe mean aka strength of copper wire is 574 lbs with a.s.d. of 8-3 lbs, 
how large a sample must be used i in order that there be chance Too that the mean breaking 
strength of the wire is less than 571 lbs ? ; 

Sol. Here pw = 574, 0 =8:3. 

xX—p mat Mn 3 Pt 
= < | ———_ |vn = - —— vn 
7 oldn ( 8-3 8-3 
Now aC <aun] = 0-01 
8-3 
P(ia < hd = 1-2 (0-01) =0-98 
3 
P| 0<z< —— = 0-49 
( "333 vi) 
vn = 2:327 (from normal tables) 
n= 8 3" (2:327) sat 46. 
n= 42. 


Ex. 15-32. The guaranteed average life of a certain type of electric light bulbs is 1,000 
hours with a s.d. of 125 hours. It is decided to sample the output so as to ensure that 90% of 
the bulbs do not fall short - the guaranteed average by more than 2:5%. What must be the 
sample size ? 


Sol. Here Hh = 1,000, o = 125 


2-5 fon we 
<p = 25ie, x >p-—25=975 
H- x > H 
_ xa —25 _ ava 
7 ide sida 8 


= 09. 


ocr] = 0-9-0:5=04 
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ee 


Ge 
5 
ea n= 4 
Ex. 15-33. /t is known that the; 
in the universe. It is, however consi 
such as to ensure that the mean of th 
the true value. How much would be 
for drawing 100 members of a samy 
Sol. Assume the simple sampli 


= 1 


Here p= 1 
0-01 
x-ul<—n = 0 
|x — pi i002 
lz] = 
0-01 =e 
10 
n= 9, 
2 


Sampling charges = Rs. 


Ex. 15-34. To know the mean w 
is taken. The mean weight of this sam 
any inference from it about the meat 

Sol. Here s.d. of the universe is 
the sample is iarge. 


9 

S.E. of the mean = 

¥225 
Assuming simple sampling conc 

probability be s.t. 

Vn |: 
Leé., Xm- 
ie., 67 - 
i.e, 65:2 


_ Ex. 15-35. The mean height of | 
2:24” Find the odds against the po. 
greater than 41-7” 

Sol. p =41:26, o =2-24, 


S.E. of the sample mean = a 


n 
The probability of z= a 
is needed. - 
Since z~M 


P(z> 1-96) = 
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0<z< 1-43) 

k. 

he mean of a population, using a 
he 0:95 that sample mean will not 
1. How large a sample should be 


wire is 574 Ibs with a.s.d. of 8-3 lbs, 


shance = that the mean breaking 


8-3 


tli =- av 


(from normal tables) 


41-46. 


in type of electric light bulbs is 1,000. 4 
he output so as to ensure that 90% of 


ry more than 2:5%. What must be the 


SAMPLING THEORY AND LARGE SAMPLE TESTS 585 
ee OP 


vn = 1:28 (from normal tables) 
Sc n= 40-9641. 
Ex. 15-33. /t is known that the mean and s.d. of a variable are-respectively 100 and 10 
in the universe. It is, however considered sufficient to draw a sample of sufficient size but 
such as to ensure that the mean of the sample would be, in all probabilities, within 0-01% of 
the true value. How much would be the cost (exclusive of overhead charges) if the charges 
for drawing 100 members of a sample be one rupee ? 
Sol. Assume the simple sampling conditions hold. 
Here nu = 100, o= 10 


_ |[x-pl< oe = 0-01 in all probabilities 


x- 0-01 
Iz} = 4 Mn <—— hn in all probabilities 
o 10 
0. 
eee 
10 
n = 9,000,000 
0 
Sampling charges = Rs. ithe = Rs. 90,000. 


Ex. 15-34. To know the mean weight of all 12-year old boys ina state, a sample of 225 
is taken. The mean weight of this sample is found to be 67 Ibs with as.d. 9 lbs. Can you draw 
any inference from it about the mean weight of the universe ? 

Sol. Here s.d. of the universe is not given but we can take in its place the sample s.d. as 
the sample is iarge. 


9 
S.E. of the mean = - = 0-6. 
V225 


Assuming simple sampling conditions, the mean weight 1 of the universe would in all 
probability be s.t. ‘ 


Vn |=) <3 
0 
j x- oe. <p<x+ = 
1.2., Jn > 7 
ie, 67- 18<w<67+1-8 
ie, 65:2 <p < 68:8. 


_ Ex. 15-35. The mean height of 10,000 children of age 6 years is 41-26” and the s.d. is 
2:24” Find the odds against the possibility that the mean of a random sample of 100 is 
greater than 41-7” 

Sol, = 41-26, o=2-24, n= 100 


o 
S.E. of the sample mean = a = 0-224. 


Th bability of _ tp 5 41:7 41-26 
e probability o Zi Si 0.224 


= 1-96. 
is needed. 
Since z~ NO, 1). 

P(z> 1-96) = 0-5-P(0<z< 1-96) 
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1 
= 0-5 —0-4750 = 0-025 = — 
ee 40 


Odds against are 39: 1. 

Ex. 15-36. Suppose that the distribution of the statures of men is a normal distribution 
with s.d. 2-48”. One hundred male students in a large university are measured and their 
average height is found to be 68-52”. Determine the 98% confidence limits for the mean 
height of the men of the university. 

Sol. Here x = 68-52, n=100 and o =2-48. 


o 
S.E. of sample mean = ae = 0-248”. 
n 


68-52-p 
0-248 . 
Now since P{|z| < 2:33} = 0-98, (from normal tables) is needed s.t. 
68-52-U 
| 0-248 
ie, 67:9 < w< 69-1. 
98% confidence limits for p are 67-9” and 69-1”. 
Ex. 15-37. The data concerning height measurement for a random sample of individuals 


from a given population are as follows : 
mean = 172, 8.D.=12,n=100 


< 2-33 


If a large number of samples of the same size were selected at random from the given 


population, what would be the limits of 2% confidence interval for the true mean ? 
Sol. The limits of 2% confidence interval for the true mean means the same thing as 
98% confidence limits for the true mean. 
Reqd. limits are 


172 + 2-33 Fea Le., 169-2 and 174-8. 


¥100 


Ex. 15-38. An unbiased coin is thrown n times. It is desired that the relative frequency 
of the appearance of heads should lie between 0-49 and 0-51. Find the smallest value of n 
that will ensure this result with 90% confidence. 


1 
Sol. Probability of head (tail) in a single toss = > 


S.E. of proportion of head = ,J-- = + — =e, 


Let p be the observed proportion of heads and 
_ p-90-5 
1/2vn 


Now since P{|z| < 1-645} = 0-9, ‘n’ is to determined s.t. 


aaa are 
1/2Vn 
1-645 1-64 
Le. 0:5 <p<05+ : 


Ova | 
0-5 — 


2Vn 


1-64 


2Vn 
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Es Sa ae 


and ie ee ee O-« 
2Vn 
. 1-645 
Subtracting = 0 
Vn 


i n= 671 

Ex. 15-39. If p is the observed 
trials, prove that the 95% fiducial limi 
are 


ptt 


Also show that 99% fiducial limi 
(p ~ p*) (2:58) = n(p 


Sol. S.E. of proportion of succe: 


Now pee 
L 


Now since P{|z| < 1-96} = 0-95, 9 
PoP’ 
Pq 


n 


p- 196 74 <p’ <p: 


Similarly, since P{\z| < 2:58} = 0+ 


/, 


P= pe. 
Le., n(p’ — py? = (2:5 
= (25, 


—y 


- Ifp;, p2 are observed proportion ¢ 
sizes n; and 1, show that 99% 
proportions of successes in the px 


Pi- pz = 2:58 


Also find 95% fiducial limits. 

2. A sample of 900 members is fou 
regarded as a simple sample fron 
cm? 

3. A simple sample of 1000 memt 
reasonably regarded as a simple : 
and s.d. 2-6 cm? 

4. The standard deviation of a popul: 
size 100(i) the sample mean will d 
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1 


40 


s of men is a normal distribution 
iversity are measured and their 
confidence limits for the mean 


u is needed s.t. 


aw” 


a random sample of individuals 


) 

slected at random from the given 
terval for the true mean ? 

ie mean means the same thing as 


lesired that the relative frequency 
0:51. Find the smallest value of n 


Ls.t. 
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and 
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ce 0-51 
2Vn 
]- 
Subtracting a = 0-02 
n 
n = 6765. 


Ex. 15-39. [fp is the observed proportion of success in n independent Bernoullian 


trials, prove that the 95% fiducial limits for the population proportion p’, for large samples, 


are 


p+ 1-96 io. 
n 


Also show that 99% fiducial limits are the roots of quadratic equation 
(p—p’) (2:58) = np’ ~ py 


S.E. of proportion of successes = at 


Sol. 


, 


Pp 


== ~ N(0, 1) 
[rs 
n 


Now since P{|z| < 1-96} = 0-95, 95% fiducial limits are given by 
p-p' 


[24 
n 
p-1-96 ‘ia <p’ <pt1-96 ine 


Similarly, since P{|z| < 2-58} = 0-99, 99% fiducial limits are given by 


p = p+ 25874 


n(p’ ~ py = (2:58)? pq = (2-58) p(1 —p) 
(2:58)° (p—p”). 


Now z= 


< 1-96 


u 


EXERCISES 


- Ifp;, pz are observed proportion of successes in two independent sets of trials of large 


sizes 1, and n2, show that 99% fiducial limits for the difference (p'1 — p’2) of the 
proportions of successes in the population are 


Pigi , P2492 
Spe D: Se ee 
Pi ~pr * 258 i Re ay 


Also find 95% fiducial limits. 


. A sample of 900 members is found to have a mean of 3-4 cm. Can it be reasonably 


regarded as a simple sample from a large population with mean 3-2 cm and s.d. 2:3 
cm ? [Ans. z = 2-6, No.] 


- A simple sample of 1000 members is found to have a mean 3-5 cm. Could it be 


reasonably regarded as a simple sample from a large population with mean 3-2 cm 
and s.d. 2-6 cm? [Ans. z = 3-6, No.] 


. The standard deviation of a population is 2-7”. Find the probability that in samples of 


size 100(7) the sample mean will differ from the population mean by 0-75 or more and 
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(ii) the sample mean will exceed the population mean by 0-75” or more. 
. [Ans. 0-0054; 0-0027] 
5. A sample of 900 members is found to have a mean of 3-47 cm. Can it be reasonably 
regarded as a simple sample from a population with mean 3-23 cm and s.d. 2:31 cm? 
[Ans. No.] 


where s? =— ls (x1; -%,)' 
M j=] 


are sample variances. 


15.4-5. Test of significance of the difference between the means of two large samples Sos 

- Let x), x2 be the means of two independent samples of sizes n, and n2 (both n and ny Note. 3. If o,2 # oy? ando 
are large) from two different populations with means [1; and [, and s.d. 0; and 02 respectively, respective samples. Respective est 
Then Ex. 15-40. The means of sim 


respectively. Can the samples be re 


~Nip and x, ~ Niu o2 
lees 2 a Ts cs ny, = 1,000, nz = 2,000, 


~ 


Zo 


oe st 2 ’. Hypothesis is wrong and |: 
z= X.~N]up-bo, —_ same population of s.d. 2-5”. 

Ex. 15-41. A simple sample o 

“42d - he 2) ~ MO, 1) s.d. of 2:56”, while a simple sampl 


a s.d. of 2:52”. Do the data ind 
hs ‘7 


Englishmen ? 
The hypothesis to be tested is ‘Are population means same i.e., [lj = [2’. Assuming this Sol. Here population standar 
hypothesis the test statistic becomes 


from samples. As samples are large 
xX) -Xx 
ae eras 
0 0 
Aol We? 5 
yy Mg 


population standard deviations. 
The significance is tested with aid of normal curve. 


oO, = 

oS 
Difference between san 
Englishmen are on the < 
Ex. 15-42. A random sample 


Note 1. If Gi: = oy = 0", then 40 p.m. with as.d. of Rs. 24 p.m. an 
X) -X their mean pay as Rs. 36 p.m. with 
z= 1 Ti pay of men from the two states diff 
Say ee Sol. Here ny = 
ue IG is 
Note 2. Ifo is not known, then it is to be estimated from the samples. An unbiased ; 
estimate of o” based upon two samples is i 


(m -1) S? + (ng -1)S3 


Ss = 
ny +n2 -2 

Difference between me: 

where Se = ; xX] )? and sf = ;-X2 is states differ. 
. 7 Ex. 15-43. Mean and standara 
S" is unbiased because of two groups taken from two univ. 
2) 2 1 2 2 M 
E(S*) = " : 5 {(m — 1) E(S;*) + (2 —-1) E(S2°)} University A 

University B 
= een {(m - 1) Oo +(m—- 1) o}= o Test the significance of the dif 
1 oo 
[ HS1°) = (Sy )= o] Sol. Here z= 


Since 1, and nz are large, ny — 1 ~ ny-and n,-1~ 


S;~s,;? and Sy ~ 0" 


Difference between the 
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:an by 0-75” or more. 
[Ans. 0-0054; 0:0027] 
1 of 3-47 cm. Can it be reasonably 
4 mean 3:23 cmand s.d. 2-31 cm ? 
[Ans. No.] 


the means of two large samples 
s of sizes m and nz (both n and nz 
id pz and s.d. O and o; respectively. 


) ~M0,1) 


ns same i.e., [41 = H2’. Assuming this 


aated from the samples. An unbiased 


St 


1 2 ahd 
a > (x2; -¥2) 


j=l 
-1) E(S2) + (tm — 1) E(S2'Y} 


Ne+(m-lo}=o 3 
Co ES2)=A(S2)= 01, 


59° 
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1 ny _ 1 ny _ 
where s? =—y Ce and s3 oe > (x2) -¥2) 
1 j=1 2 j=) 


are sample. variances. 


ge = mst 289 
ia ny +n ; 

Note. 3. If o, x oy and 6, and 62 are unknown, these are to be estimated from 
respective samples. Respective estimates are Sy ~s,? and Sy ~s,’. 

Ex. 15-40. The means of simple samples of sizes 1,000 and 2,000 are 67-5” and 68-0” 
respectively. Can the samples be regarded as drawn from the same population of. s.d. 2-5”? 

Sol. nn; = 1,000, n2 =2,000, x, = 67-5, x7 = 68-0 and o=2:5. 

we z= 52(>3) 

”. Hypothesis is wrong and hence the samples cannot be regarded as drawn from the 
same population of s.d. 2:5”. 

Ex. 15-41. A simple sample of heights of 6,400 Englishmen has a mean of 67-85” and 
s.d. of 2:56”, while a simple sample of heights of 1,600 Australians has a mean 68-55” and 
a s.d. of 2-52”. Do the data indicate that Australians are on the average taller than 
Englishmen ? 

Sol. Here population standard deviations are not known so these are to be estimated 
from samples. As samples are large, sample standard deviations can be taken as estimates of 
population standard deviations. 

; O; = 2:56, G2 = 2°52, 
z= 10(>3). 
Difference between sample means is significant. 
Englishmen are on the average smaller than Australians. 

Ex. 15-42. A random sample of 1200 men from one state gives their mean pay as Rs. 
40 p.m. with as.d. of Rs. 24 p.m. and a random sample of 1600 men from another state gives 
their mean pay as Rs. 36 p.m. with a s.d. of Rs. 32 p.m. Discuss whether the mean levels of 
pay of men from the two states differ. 


Sol. Here ny = 1200, x, =40, 0) = 24 and 
ng = 1600, X7 = 36, 02 = 32 
4 
2 “= 3-78 (> 3) 
(24), G2)" 
1200 1600 


Difference between means is significant and hence mean levels of pay in two 
states differ. 
Ex. 15-43. Mean and standard deviations calculated from the weights in kgm of students 
of two groups taken from two universities are given below: 


Mean S.D. Sample size 
University A 55 10 400 
University B 57 15 100 
Test the significance of the difference between the means. 
57-55 


Sol. Here ~ 1-2648 (< 1-96) © 


“fan? | 05? 
400 100 


Difference between the means is due to fluctuation of sampling only. 
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Ex. 15-44. A random sample of 1000 farms in a certain year gives an average yield of 
rice 2000 /bs per acre with a s.d. of 192 lbs. A random sample of 1000 farms in the following 
year gives an average yield of rice 2100 lbs per acre with a s.d. of 224 lbs. Show that the 
data are inconsistent with the hypothesis that the average yield in the country as a whole 
was the same in the two years. 

2100-2000 
Here 2, = —_—— _ = 10:7 (3) 
(192)? (224)° 
1000 1000 
Data is inconsistent with the hypothesis. 

Ex. 15-45. 4 potential buyer of light bulbs bought.50 bulbs each of two brands. Upon 
testing these bulbs, he found that brand A had a mean life of 1282 hours with a s.d. of 80 
hours whereas B had a mean life of 1208 hours with a s.d. of 94 hours. Can the buyer be 
quite certain that the two brands do differ in quality ? 


Sol. Here ny = 50, x; = 1282, o) = 80 
and ny = 50, X2 = 1208, 02 =94. 
z= 42(>3) 


Difference is significant. 

Ex. 15-46. A random sample of 200 villages was taken from a certain district and the 
average population per village was found to be 485 with a s.d. of 50. Another random 
sample of 200 villages from the same district gave an average population of 510 per village 
with a s.d. of 40. [s the difference between the averages of the two samples significant ? Give 
reasons. 


Sol. Here ny = 200, x; =485, o) =50 
and ny = 200, x, =510, 6. =40 
510-485 ; 
lz, = — = 55 (> 3) 
(50)? + (40)? 
200 © 


Difference is significant. 
Ex. 15-47. /f60 new entrants in a given university are found to have a mean height of 
68-60” and 50 seniors a mean height of 69-51”, is the evidence conclusive that the mean 
height of the seniors is greater than that of the new entrants ? Assume the s.d. of the height 


to be 2-48”. : ee 
Sol. Here ny = 60, x, = 68-6, 6, = 2-48 
and ng = 50, x, = 69-51, Oo; = 2-48 
0-91 


z| ~ heer se as 1:92 (< 1-96) 
2-48 ,/— +— 
V60 50 


Difference is insignificant and hence it cannot be said that the mean height of the 
seniors is greater than that of the new entrants. 

Ex. 15-48. A sample of 100 electric bulbs produced at a factory A showed a mean 
lifetime of 1190 hours and a standard deviation of 90 hours. A sample of 75 bulbs produced 
at factory B showed a mean lifetime of 1230 hours with a standard deviation of 12 hours. Is 
there a significant difference between the mean lifetimes of the two brands of bulbs at 5% 
level of significance ? 

Sol. Here ny = 100, x; = 1190, o, =90 
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m = 7 


Difference is significant 
Ex. 15-49. A certain psychok 
prisoners : (a) first offenders, (b) re 


Population San 

Si. 
(i) First offenders Si 
(ii) Recidivists 7 


Find the 95% confidence limit: 


( 
Sol. zZ=- 


95% confidence limits a1 


6. 
i.e, 6-43 — (0-48) (1-96) < p 
i.e, 5-4892 <p) — by < 7370 


Ex. 15-50. Zwo populations he 
other. Show that in samples of 45: 
difference of means will in all probe 
find the probability that the differer 

Sol. Let 1 be the common mear 
Let x, and x» be the sample mean: 


- Then Zz 
Now \2| 
[i -¥]< 
Lé., Xj ~X re 

E42 10 


Now P{|x,—xX,| >0-050} 
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na = 75, Xx = 1230, 02>= 12 


1230-1190 

zg] = = 439 (> 1-96) 
(90°, (12)? 
10075 


Difference is significant at 5% level. 


Ex. 15-49. A certain psychological test was given to two groups (samples) of army 
prisoners : (a) first offenders, (b) recidivists. The sample statistics were as follows : 


Population Sample Sample Sample 
size mean s.d. 

(i) First offenders 580 34-45 8-83 

(ii) Recidivists 786 28-02 8-81 


Find the 95% confidence limits of the difference of the means for the two populations. 
(34-45—28-02)—(py -L2) 
a 


Sol. 
(8-83)? : (8-81)? 
580 786 
_ 6-43-(by —H2) 
- 0-48 = AGT) 
95% confidence limits are given by 
6-43-(Hi-H2)} 1.96 
0-48 
ie, 6-43 — (0-48) (1-96) < pt — M2 < (0-48) (1-96) + 6-43 
iLe., 5-4892 <j — Ho < 7-3708. 


Ex. 15-50. Two populations have the same mean, but the s.d. of one is twice that of the 
other. Show that in samples of 4500 each drawn under simple sampling conditions the 
difference of means will in all probability not exceed (0-1) 6 where o is the smaller s.d. and 
find the probability that the difference exceeds half that amount. 

Sol. Let 1 be the common mean and 6, 20 be the standard deviations of two populations. 
Let x, and x, be the sample means. 


Then z= oo ae?) 
_o , Qo)’ 
4500 4500 
5 Aree 
2 
30 
Now lz] < 3 
| 3 -m\<& = 
ie., |x, -~X2]< ie 0-1o. 
Now P{|x,-X,| >0-050} = P{lz|>1-5} 
= ]—Pf{\z|<1-5} 


= 1-2P{0<z<1-5} 
= 1-2 (0-4332) =0-1336. 
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Ex. 15-51. Jn an intelligence test administrated to 60 boys and 100 girls, the following Sample siz 
results were obtained : Ist Sample 300 
Mean score S.D. 2nd Sample 200 

Boys 114 13 ; Is the difference between th 

Girls 110 11 7. Two populations have the s: 

Assuming the correlation coefficient between the two to be 0-75, test whether the of the other. Show that in sar 

difference between the means is significant. the difference of the means 

Sol. S.E. of the difference between the means smaller s.d. and assuming th 

5 5 find the probability that it e: 

_ fSi, 92_ 575192 8. Two population have their 1 

mM nny that in the samples of size 20 

difference of means will, ina 

(13) (l 1)? ; (0-75)(13)(11) What is the probability that th 


60 100 100-60 15.4.6. Test of significance of dit 
samples 
Soft 2577 & P12 : 
Consider two large independ 
from two populations with standa1 
The hypothesis to be tested is 
Assuming this hypothesis, the stat 


4 e 
kl = y= 36C3) 


Difference is significant. 
a EXERCISES 
1. The data given below gives the mean and s.d. of stature of two groups of boys taken 


zs 
from a certain city : 
Sample size Sample mean Sample s.d. for large samples. Now for large s 
1145 48-6 2-416 var (s}) = 
654 50-79 2:53 
Find whether the difference between the means significant. [Ans. Significant] var (Sj — 5) = 
2. 64 senior boys from college A and 81 senior boys from college B had mean heights of 
68-2” and 67-3”,.respectively. If the s.d. for heights of all senior boys is 2-43”, is the 
difference between the two groups significant ? S.E. (5) —52) = 


[Ans. z = 2:21, significant at 5% level] 

3. A random sample of 1000 men from northern region gives their mean wage to be 

Rs. 21-50 per day with as.d. of Rs. 1-5. A sample of 1500 men from southern region 

gives a mean wage of Rs. 21-70 per day with as.d. of Rs. 2. Discuss whether the mean 
rate of wages varies as between the two regions. [Ans. z= 2-85] ° 

4. Two random samples of sizes 1000 and 1500 give following values of mean and s.d. : 


For large samples, z 


The significance is tested with 
Ex. 15-52. Random samples « 


Sample size Sample mean Sample s.d. data relating to the heights of mak 
1000 47 28 out 

m 

aad a? 40 University A 67 

Test whether the difference between means is significant. {Ans. No] University B 67 


5. Intelligence test on two groups of boys and girls, give the following results. Examine 
if the difference between means is significant. 


(2) Is the difference between t 
(ii) Is the difference between tl 


Sample mean Sample s.d. Sample size Sol. (i) Here ny = 
Girls 84 10 121 and nm = | 
Boys 81 12 ; 81 
[Ans. Not significant] sais 


6. Two samples of bricks, produced at two different works, were tested for transverse 


ith th i : : ee 
strength oe the following results ‘Theleiend-<ignineantd 
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2:416 
” 2:53 
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-s of all senior boys is 2:43”, is the 


5. z = 2:21, significant at 5% level] 
‘gion gives their mean wage to be 
of 1500 men from southern region 
of Rs. 2. Discuss whether the mean 
[Ans. z = 2-85] 
following values of mean and s.d. : 
Sample s.d. 
28 
40 
mificant. [Ans. No] 
give the following results. Examine 


d. Sample size 
121 
81 
[Ans. Not significant] 
nt works, were tested for transverse 


SAMPLING THEORY AND LARGE SAMPLE TESTS 593 
Sample size Sample mean Sample s.d. 
1st Sample 300 990 240 
2nd Sample 200 1000 202 
Is the difference between the means significant ? [Ans. Not significant] 


7. Two populations have the same mean, but the standard deviation of one is twice that 
of the other. Show that in samples of 500 each drawn under simple random conditions, 
the difference of the means will in all probability not exceed 0-3 o where o is the 
smaller s.d. and assuming the distribution of the difference of the means to be normal, 
find the probability that it exceeds half that amount. [Ans. 0-1336] 

8. Two population have their means equal but s.d. of one is twice that of the other. Show 
that in the samples of size 2000 from each drawn under simple sampling conditions, the 
difference of means will, in all probability, not exceed 0-15 o where o is the smaller s.d. 
What is the probability that the difference will exceed half this amount? [Ans. 0 -1336] 


15.4.6. Test of significance of difference between the standard deviations of two large 
samples 
Consider two large independent samples of sizes n,, n2 and standard deviations s}, 5, 
from two populations with standard deviations 0), 62 respectively. 
The hypothesis to be tested is ‘Are population standard deviations same ‘i.e., 6; = 69.’ 


_ Assuming this hypothesis, the statistic 


51 ~52 


= 12 -N00,1 
a S.E.(s] —s2) ( , ) 
for large samples. Now for large samples drawn from normal populations, 
eB yee 
var (s}) = Om and var (s2) = On, 
a i. Bicramecrs 
var (s; — 52) =. var (s;) + var (s2) = 2n + 2ny 
ne cf Sta Sts 
i 2, 2n 
S1—S2 
For large samples, z = 5 ~N(0, 1) 
o] o2 
—— + ~~ S 
2ny 2n 


The significance is tested with the aid of normal curve. 
Ex. 15-52. Random samples drawn from two universities A and B gave the following 


data relating to the heights of male students : 


Sample Sample Sample 
mean s.d. size 
University A 67-42 2-58 1000 
University B 67-25 2:50 1200 
(i) Is the difference between the means significant? 
(ii) Is the difference between the standard deviations significant ? 
Sol. (2) Here ny = 1000, x; = 67-42, 5, =2-58 
and ny = 1200, x2 = 67-25, 5; =2-50 
pero se (< 1-96) 
(2-58) 2 5)" 
1000 1200 


There is no significant difference between sample means. 
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0-08 


(2:58)? | (2-5)? 
2000 2400 
Sample standard deviations are not significantly different. 
Ex. 15-53. The mean yield of two sets of plots and their variability are as given below. 
Examine (i) whether the difference in the mean yields of the two sets of plots is significant 
and (ii) whether the difference in the variability in yields is significant. 


~ 1-04 (< 1-96) 


(ii) z= 


Set of 40 plots Set of 60 plots 
Mean yield per plot 1258 1243 
S.D. per plot 34 28 
15 
Sol. (2) 2) = eee = 2-3 (< 2°58) 
(34) | (28) 
40 60 
Difference between the mean yields is insignificant at 1% level. 
6 
(ii) jz| = = 13(< 1-96) 
(34)? (28)° 


2(40)  2(60) 


Difference is not significant. 


EXERCISES 
1. Test whether the difference between the standard deviations is significant, given that 
Size . s.d. 
Sample A 1,392 53-84 
Sample B 630 56:56 [Ans. Not significant] 
2. Two samples of sizes 1000 and 800 gave the following results : 
Medians S.D. 
Ist sample 17-5 2:5 
2nd sample 18 2:7 


Assuming that samples are independent, test whether the two samples may be regarded 
as drawn from the universes with same standard deviations. [Ans. Yes at 1% level] 
Ex. 15-54. Two samples of sizes 100 and 80 gave the following results 


Medians S.D. 
Ist sample 85 7 
2nd sample ; 100 8 


Test whether the difference between the medians is significant. 

Sol. Let 6), 62 be the standard deviations of two samples of sizes n, and nz. Then 
assuming the samples to be independent. 

S.E. (e) of the difference between the medians 


of , 93 
(1:25331) yJ—- + = 
ny ny 


Here oO; = 7, 02=8,n, =100 and n. =80 
, e= 1-42 
_ Difference between medians = shee. ous 
Hl = e aa 


’ Difference is.highly significant. OU 
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16.1. y? distribution 


Let x,,X3.....,x, bem independent standard normal variates. 
Then, each one of 


14 2 eee ds 
an +X7°4+......4+%X, ) isa (2) variate. 


2 2 2 2 y? ; 
TR yo ax + xg tet, “7” 8a Y( 5 | variate. 


-. Distribution of © 


is 
a i 2 
] a ae) 2\2 
woe [S} 45] 
nl 
T = 
(3) 
© 
i, dP = ————~e 7 (w*)? dy’) 
2nr( 2 
2 


where 0< y? <a, 


This distribution is known as chi-square distribution and yw is called chi-square variate. 


n is called the degrees of freedom associated with chi-square distribution. 
Remark. (1) Normal distribution can be regarded as a particular case of chi-square 
distribution for n=1. 


Hereafter chi-square distribution will be written as yw? -distribution. 
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(2) x1, x2,.:.-,X, can be represented by a sample point with co-ordinates (x; jp Ape GK ) 
in Euclidean hyperspace of n dimensions. If these variates are subjected to a linear constraint, 
that constraint can be considered to represent a hyperplane. Thus, the effect of this constraint 
is to lower the dimension by one and hence the number of degrees of freedom associated 


with y* will be n—1. 
In general, if there are p independent linear constraints, the number of af. is n- p. 
16.1.1. M.G.F, of w’-distribution 


M(t) oe Ele” } 
> wey 
> 2 


fy 
(y?)2 d(y?) 


2 


—da-2ny? 


- ma)t ey?) dy?) 
0 


I 
Put hs ~2t)y? = 


{ 
<< 


dw?) 


es nl2 n 

(1 ~ 27) r(3 

ib (3) 

— (- 21"? r(3) 2 
2 


ah 
= (1-21) 2 
which exists only when, |2¢|< 1. 


16.1.2. Moments and 8, y co-efficients 


My(t) = (1-22) 2 


ies! 
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wN|s 
Nee” 
wD [7 ‘ 
=]|N|z 
+ 
| oneal 
ee 
ease 
NO 
™” 
— 
nN 
+ 


ws 
t 
ys 


CHI-SQUARE DISTRIBUTION 


meal 


Ht’ (0) 


Ho 


which gives variance. 


43(0) 


Hs 


14 (0) 

Ha = 24(0)-4 
= An +2) 

= n*412n3 


12n? +48 


tl 
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11; (0) = co-efficient of 7 
r! 


1 
~ 
=e 
J 
+ 
ie) 
= 
~ 
= 
+ 
ie) 
S 

| 
pa 
— 


or 


mean = Ht, (0)=n 
wt, (0) = n(n+2)t 


Hy = n(n+2)—n? 
2n 


which gives variance. 
3(0) = n(n+2)(n+4) 


3 = 15(0)—3y5 (0); (0) + 2{uj(0)}? 
= n(nt+2)(n+4)—-3n?(n+2)+2n? 
= n(n? +6n +8) — 3(n? + 2n*)+ 2n? 
= 8n ‘ 

14(0) = n(n+2)(n+4)(71+6) 


Hg = 14(0)— 415 (Oj (0) + 615 (0) {4 (0)}* ~ 3{n{(0)}* 
n(n+2)(n+4)(n + 6)—4n?(n+2)(n+4) + 6n3(n+2)—3n4 


= n'+12n? +44n? +48n ~4n?(n? +6n+8)+6n?(n+2)- 3n* 


= 12n? +48n 
B,y Co-efficients 
2 
8 12 
B, = an oo B. = = =3e 
My 7 He - 
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16.1.3. Cumulative Function and Cumulants 


Ky(t) = log My(t) 


n 
= jog (1—2r) 2 


n 
-— log (1-2 
5 g ( ) 


2 r 
2 far OO 4, beabed | 
2 2 


k, (0) = co-eff. of t=n 


k =2'"(r-Din, r22. log {M) (or) = 


t" 
, = co-efficient of a 
r} 


16.1.4. Mode . 
The density function is 
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1 3 n 2 
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(y ) | 2 2. | 1p 
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fiw?) = 0> y? =n-2, 0 “. Zand hence y? tends to normal - 
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©. for y? =n—2, f(y?) is maximum. 

Theorem. The sum of any finite nun 
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“. Zand hence y? tends to normal Variate as_n —> @, 
16.1.6. Additive Property of yw? -variates 


. : Do ce oat mdi an eNO yy Ft, 
Theorem. The sum of any finite number of independent \~ -variates is a \W" -variate. 


Proof. Let Vv, We ewe? be n independent wy? -variates with M,Nz....n, degrees 
of freedom respectively, 


Then OO) ga SI? pes 
Let y? = Wr tyy74 asttes +y,” 
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= Ele" }Ele""? }....E{e™" } 


Ry 


= (1-28) 21-28) 2... = 2n 2 


Ny ty tut Ny ) 


= (1 sna ss 


which is the m.g.f. of a w? -variate with (1, +...... tn,) df. 

wy? isa y? -variate with (n,+......- +n,,) af. 

Ex. 16-1. /f we and yo are two independent w? -variates with ny andn, df 
respectively, then wel yo is a p,(. 2] variate. 


2 2 
Sol. Distributions of Wi @"d V1" respectively, are 
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. Marginal distribution of x is 
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Wy 
2 
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are independent. Hence find their distributions. 


2 oe. 
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= (1-24)M~, 
Differentiating r times w.r.t. 
l ee ae. 1-2 M3 (1) 2M? 
ny+n e? y 2 dy x (1-2t) w(t) ~ POE NES 
a r(Z% | 
2 
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riates with n, and n, df. respecti- 


2 
V2 


1 
)2 dy, dy.” 


O< wy. Wo <@ 


2 
dy=yyP tw, 


y =y(l-x) 
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and “att. 7. Ne = y: 


22 pl mtr 
2 


and y isa wy? -variate with (m, +n.) df. 
Ex. 16-3. For a wy? -variate with n df. show that 
Bre = 2r(h1, +My) r21 


Sol. For a y? -variate with n.df, 


mean =n 
.. M.G.F. about mean is given by 


M-. (t) = Ber”) 


e™ My (t) 


eva a yy Was 


log {Ma2 wh er = log (1-2) 


Differentiating w.r.t. ‘7’ 


ne 
Ma() 5 2 1-2r 
= 2nt 
1-21 
> (1-2t)M7, (t) = 2ntM (t) 


Differentiating r times w.rt. ‘t’ by Leibnitz’s theorem 


(1-22) M"*1y2 (t)— 2rM (1) = Wnt" PO +rM BP (0} 
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(1) 
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Now H, = {M-. (t)},<0 
.. Substituting ¢ = 0 in (1) 
. Myst 27H, = 2nrp,; 
> Meat =2r {H, +ap,_}. 
16.1.7. Chief Features of the chi-square Probability curve 


The eq. of the y? probability curve with n d.f. is 


1 
DA RY ON 
qn2p n 
2 


logy = shat +(3-1} log y? — log 2”? - log (3) 


2 


he. By 
e? (y?)? 


Since y? >0, y>0, we have 


dy 
= — <0 
for . n= 1,2, dy? 
and for n> 2, 
dy 
dy? ze if 0<y? <n-2 
= if y? =n-2 
<0 if yw? >n-2 


”. For n= 1, 2, y decreases continuously as yw? increases and for n > 2, y increases or 


decreases as wy” increases according as w?<n-2 or w*>n-2 and for 
d 

yw? =n-2,—- =0 
dy 


2 which implies that y is maximum. 


.. For all values of n, y > 0 as y —> 0, 


: Ww -axis is an asymptote to the curve. 
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The shape of curve for n = 1, 


Ex. 16-4. If y* is a chi-squ 


normally distributed about mean 


Sol. Now \2w 


2y 


ws 


Now Von 
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The shape of curve for n = 1, 2, 3... is shown below : 


urve 


By 


(y?)? 


log w? — log 2” — log r(2) 


{ 
Ex. 16-4. /f y? is a chi-square variate with n df, show that if n is large J2y? is 


normally distributed about mean Jf} —| with variance unity. 
Sol. Now 2y? < V2n-1 +Z 5.0 
if Qw? < (Qn-l+2°+2V2n-l1z 


1 1 
i.e, : yw < n- 5450 +V2n— lz 


beet | 


lA 


A 


we-n 1 ee ye eee 
a V2n = 2W2n 2V2n \ On 


iLeé., < z, as nis large 
O<y? <n-2 V2n 


y? =n-2 


p {Y2u? <V2n=1 +z} 
= p { 20? -Van-1 <2} 


N ¥ =" _ NO,1 
OW V2n = : ) 


(2? — J2n-1 ~ N(O,1) 
> {2w? aN (A 2n= 1-1): 


~ 
a 
€ 
3] w 
= | 
IA 
nN 
a) 
2 


9 
wo>n-2 
icreases and for n > 2, y increases OF 


‘<n-2 or w?>n-2 and for ; 
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EXERCISES : Also 0, +0,+...40, =n 


1. If wy? is a chi-square variate with n d.f., show that if n is large aw? is normally If x is sufficiently large so 


seat . . . factorials can be used. 
distributed about mean ,/>,, with variance unity. 


2. Show that if x has the standard normal distribution, then x? has the chi-square 
distribution with one degree of freedom. ; oe PR> 


3. Show that, for 2 degrees of freedom, the probability P of a value of y? greater than I] 


i=l 


1 3 
Vo- is exp. & ve" | and hence show that 


2 1 , 
Wo =2 log, (=) | (2n 


4. Prove that for a random sample of size 10 from a normal population with variance 1, 


10 
PSs 3 225} <0-005 | im 


i=] | 


(Given that P(y?(9) < 23-59) = 0-995). where c= — 
2n 
16.2. y’-tests \ 
Tests of significance based on yy? -distribution are called y? -tests. log P = log 
~ o 
Cells. When a given data is arranged in compartments, the compartments are called 
cells and the corresponding frequency is called Cell Frequency. _ 
Linear Constraints. Constraints which involve linear equations in the cell frequencies Now = Ae 
(i.e., equations containing no squares or higher powers of the frequencies) are called linear = np; 
constraints. , | . 
Degrees of Freedom. /t is the greatest number of cell frequencies which can be assigned Tet Bei (0; 
arbitrarily. It is given by t Fi 


! 
' | 
ven-k | 
where n is the total number of cells and k the number of independent constraints. | 


Definition of y?. If O; and e, be the observed and expected frequencies, the variate 


yw? is defined hy 


(0; -¢)" 
y? = oy i i 
i & . k 
7 s A A ‘ . ae l Tee 
This variate follows yw -distribution as seen below : °8 C 2, 
i= 
Let there be a random sample of size n whose members are distributed at random in k 
cells. 
Let p, = prob. that a member is in ith cell. Then, the prob. that 0, members are in Ist | k 
cell, 0, members in 2nd cell etc., is given by - p 
n! 0, 03 Oo 
Py Pz Ov Px 
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tifn is large {2y? is normally 


tion, then x? has the chi-square 


ry P of a value of y? greater than 


1ormal population with variance 1, 


005 


called vw -tests. 

nents, the compartments are called 
“equency. 

2ar equations in the cell frequencies 
of the frequencies) are called linear 


Il frequencies which can be assigned 


of independent constraints. 


ad expected frequencies, the variate- 


Wi 


mbers are distributed at random in k 


the prob. that 0, members are in Ist 
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Also 0, +0,+...+0, =ENn 


If n is sufficiently large so that 0, ,03...... 
factorials can be used. 


607 


(1) 


0, are not small, Stirling’s approximation for 


pes Pi P2? Pat 
IT} 2ne ofa} 
i=l : 
a 1 1 1 np, mae) NP, me 
ae ilo, J “Lo 
(2m)? n? (pypy.-Px)? : 
I 
7 n(%) 
= 0; 
where c= : 
k-l kel a 
(2m)? n? (pypg.---Px)? 
log P = weer (0, + +) 8 Be 
Now e, = expected frequency of ith cell 


Let é, = 
ae 


; 
log P x loge+ Ye +e; 3} bel 
i ej + bi Ve 


~ Ye +iae+ 


i=] 


ie : Nl 1 
ni Sheed 


tpi A 
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If e, is large, &; will be small as compared to Je; and hence the expansion of 


gi 
log y1+ is valid. 
ve, 


.. Assuming e; large, 


log = ~ Dxe + Jfe;E;+ nina : Ag, ae | 


i=] 


k 
ae, 
“4 fo o38 ze] 
k k : . 
Now YEE: = >: {0, -¢:} 
i=l i=l 
= % “ye =0 (1) 


.. Neglecting small quantities y O [| ’ 


i=} 


=> Each €; is distributed as N(0, 1). 


Also &;’s are connected by linear relation (1) 


is distributed as yw? -variate with (k-1) df. 


Conditions for the Application of y’ test 
(i) The members of the sample must be independent. 
(ii) Constraints on the cell-frequencies, if any, should be linear. 
(iii) N, the total frequency must be reasonable large. N should be at least 50, however, 
few the number of cells. 
(iv) No expected or theoretical cell frequency should be less than 5. It is better if it is 
greater than or equal to 10. 
Note. /fany expected cell frequency is less than 5, then to apply W? test this cell is to be 


’ 
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merged with the preceding or s 
adding the cell frequencies of c 
Rules of Decision 

Let P= P( 

For various fixed values of 
of Wo° are tabulated in the fo 
variate is used. Thus role of dec 


Values of y? at specified 


jrom the table. Generally 5% a. 


acceptable at the 5% level of sig 
of significance. 

Alternately, the probability 
and if this is not small the hypo 


Remarks. (1) If y? = 0,0 
i.e., observed and expected frequ 
frequencies differ greatly, y is 


theory and experiment. 
(2) Not only small values o 
near to unity may also lead to a 


(3) y? -test depends only o1 


_ of freedom. It does not make 


y?-variate does not involve any 
test. 


(4) An alternate expressior 


(5) The value y,” is calle 


Uses of w’-test 
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and hence the expansion of 


(1) 


nt. 

uld be linear. 

_Nshould be atleast 50, however, 
Id be less than 5. It is better if it is 


on to apply W test this cell is to be 
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merged with the preceding or succeeding cells so that the new cell frequency (obtained on 
adding the cell frequencies of cells merged) is more than 5. 
Rules of Decision 

Let P= P(w?2wWo) 

For various fixed values of P and for degrees of freedom n ranging from 1 to 30, value 
of Wo are tabulated in the form of y? table. For n> 30, a property that yw? is normal 
variate is used. Thus role of decision is as below : 


Values of w? at specified levels of significance for given degrees of freedom are seen 


from the table. Generally 5% and 1% levels are taken. If y cal ~ 700s , the hypothesis is 
acceptable at the 5% level of significance otherwise non-acceptable. Similarly, for 1% level 
of significance. 

Alternately, the probability P is determined. If this is small, the hypothesis is rejected 
and if this is not small the hypothesis is accepted. 

Remarks. (1) If y* = 0,0; =e; V; 
ie., observed and expected frequencies coincide. On the other hand, if observed and expected 
frequencies differ greatly, w is large. Thus y gives a measure of correspondence between 


theory and experiment. 
(2) Not only small values of P lead us to suspect the hypothesis but the value of P very 
near to unity may also lead to a similar result. 


(3) w? -test depends only on the set of observed and expected frequencies and on degrees 


_ of freedom. It does not make any assumptions regarding the parent population. Since 


y?-variate does not involve any population parameter, this test is known as Non-parametric 


test. 


(4) An alternate expression for y is as below : 


0.-e.)” 
ie 3 = 


ea ** Ye. = x0 
ey sat (- Ze; = Z0;) 


(5) The value Wo is called critical value. 


Uses of ?-test 
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Some of the uses of the y? -test are : 
(i) To test the goodness of fit. 
(ii) To test the independence of attributes. 
(iii) To test for variance of a normal population. 
(iv) To test the homogeneity of several independent estimates of the population variance. 


(v) To test the homogeneity of several independent estimates of, population correlation 
co-efficient. 
(vi) To combine various probabilities obtained from independent experiments to give 
a single test of significance. 
Note. Here only (i) and (ii) will be considered. 


16.2.1. The Test of Goodness of Fit 


One of the principal uses of yw? distribution is to test how well an observed distribution 
fits a theoretical one. When yw? -test is used in this way, it is called the test of “goodness of 
fit”. The expression within inverted commas may be used in two ways. In the first place it 
may describe the “fit” of observed to the hypothetical data. In the second it may be used, 
without reference to a hypothesis, merely to test the merits of a particular formula or a 
particular curve in graduating a set of values or a series of points, e.g., it may be tested how 
well a binomial distribution or normal distribution or Poisson distribution fits the given 
data. The calculations in both the cases are exactly on the same lines. 

Ex. 16-5. /n experiments on pea-breeding, Mendal got the following frequencies of 
seeds : 315 round and yellow; 101 wrinkled and yellow; 108 round and green; 32: wrinkled 
and green. Theory predicts that the frequencies should be in the proportions 9:3:3:1. Test 
the correspondence between theory and experiment. 

Sol. Total frequency = 315 + 101 + 108 + 32 = 556. 


ae 
.. Expected number of round and yellow seeds = ie = 313. 
: 3 
Expected number of wrinkled and yellow seeds = 6 556 = 104. 
oa 3 x 
Expected number of round and green seeds = ie = 104. 


1 
Expected number of wrinkled and green seeds = aa = 35. 


2 _ (313-315) : (101-104)? ‘ (108-104)? é (32-35)? 
313 104 104 35 


= 0-013 +0-087 +0-154+0-257 =0-5. 
Since there are four expected frequencies, number of d.f. 


= 4-1=3. 
From table "00s for 3:'d.f. = 7-815 


Now Weal < Weas : 
-. The difference betweén expected and observed frequencies is not significant at 5% 
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level of significance. 
.. Expériment is in agreemen 
Ex. 16-6. A genetical law say. 
the other parent of blood group N 
N, and that the average numbers 
report on an experiment states as , 
parent, 28-4% were found to be of 
Do the data in the report conform 
Sol. Total freq. = 162. 
Observed frequencies are 
oe 6 ad 46, +2 .162 = 
100 100 
and expected frequencies are 


1 62 = 40-5,2.162 =81: 
4 4 


yv 


No. of d. 
Now . Woos” for 2d. 


V cal 
.. Hypothesis may be correct 
Ex. 16-7. 300 digits were chose 
Digit 0 1 2 
Freq. 18: 32 28 
Test the hypothesis that the di 
which the data were collected. 
Sol. On the assumption that di; 
frequency of each class 


300 _ 
10 


1 
2 = —{(E 
y 30 
= 31-3 
No. of df= 10-1=9 
From tables, wi; for 9 d.f. : 


DS ¢! 2 
Veal = Vo.0s 

.. Assumption is wrong. 

Ex. 16-8. 200 digits were chi 
digits were : : 
Digits 0 1 . 2 
Freq. 18 19 2304 | 


Use wy? test to assess the cor 
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ites of the population variance. 


ates of population correlation 


lependent experiments to give 


v well an observed distribution 
called the test of “goodness of 
two ways. In the first place it 
In the second it may be used, 
3 of a particular formula or a 
ints, e.g., it may be tested how 
son distribution fits the given 
me lines. 
t the following frequencies of 
round and green; 32-wrinkled 
1 the proportions 9:3:3:1. Test 


56 = 313. 
56 = 104. 
56 = 104. 


56 = 35. 


108-104)? i (32-35) 
104 35 


=0:5. 
Jf. 


uencies is not significant at 5% 
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level of significance. 

.. Experiment is in agreement with the theory. 

Ex. 16-6. A genetical law says that children having one parent of blood group M and 
the other parent of blood group N will always be one of the three blood groups M, MN and 
N, and that the average numbers of children in these groups will be in the ratio 1:2:1: The 
report on an experiment states as follows of 162 children having one M parent and one N 
parent, 28-4% were found to be of group M, 42% of group MN and the rest of the Soup N. 
Do the data in the report conform to the expected genetic ratio 1:2:]? 

Sol. Total freq. = 162. 

Observed frequencies are 


ees as re 162 = 68 and 162 —46- 68 = 48 
100 100 


and expected frequencies are 


7162 = 40-5, 2-162 ~8land 40-5 


ke 2 2 2 
w (5-5) G3) 40 5) 
40-5 81 40-5 


=~ 4-2. 
No. of d.f. = 3-1=2. 
Now . Voos” for 2d.f. = 5-99 


Weal Woos- 
*. Hypothesis may be correct and hence genetical law appears to be correct. 

Ex. 16-7. 300 digits were chosen at random and found to give the following distribution : 
Digit 0 1 “2 3 4 5 6 7 8 9 
Freq. 18 32 28 34 42 50 17 23 27 29 

Test the hypothesis that the digits were distributed in equal numbers in the table from 
which the data were collected. 

Sol. On the assumption that digits are distributed in equal numbers in the table, expected 
frequency of each class 


y? = (a2? $27 4.9? 44? 4(12)7 +20)? 4-013)? £77 43" 4:17} 
~ 31-3 
No. of df = 10-1=9 
From tables, woos for 9 d.f. = 16-92 
Weu > Voos 


-. Assumption is wrong. 
Ex. 16-8. 200 digits were chosen at random from a set of tables. Ti he Srequencies of 


digits were : 


Digits 0 1 2 3 4 5 6 7 8 9 
Freq. 18 19 23 21 16 25 22 20 21 15 


Use yy? test to assess the correctness of hypothesis that the digits were distributed in 
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equal numbers in the table. Given that the values of yw? are respectively 16-9, 18:3 and 


19-7 for 9, 10 and 11 degrees of freedom at 5% level of significance. 

Sol. Set the hypothesis ‘The digits were distributed in equal numbers in the table’. Then 
expected frequency of each digit 
200 = 20 

10 
25 hig 14941416425444041425} 
v 20 


Beater ee 


No. of d.f. = 10-1=9 
Now y7o.05 for 9 d.f. = 16-9 


Weal < woos 
.. Data is consistent with the hypothesis and hence the hypothesis may be correct. 
Ex. 16-9. In 120 throws of a single die, the following distribution of faces were obtained : 
Faces 1 2 3 4 5 6 Total 
Freq. 30 25 18 10 22 15 120 
Test whether these results constitute a refutation of the ‘equal probability’ hypothesis. 
Sol. Set the ‘equal probability’ hypothesis. Then expected frequency of each face 


~ 29. 
6 
y? = 5p {100+25+4+100+4425} =12-9 
No. of d.f. = 6—1=5- 
woos = 11:07 


Weal > woos 
“. The hypothesis is wrong. 
Ex. 16-10. The following figures show the distribution of digits in numbers chosen at 
random from a telephone directory : 
Digit 0 1 2 3 4 5 6 7 8 9 
Freq. 1026 1107 997 966 1075 933 1107 972 964 853 = 10,000 
Test whether the digits may be taken to occur equally frequently in the directory. 
Sol. Set the hypothesis ‘Digits occur equally frequently in the directory’. Then caprricd 
frequency of each digit 
10,000 


= ——— = 1,000 
10 


y? = ap 28” +(107)? + (3)? +(34)? + (75)? +(67)? 


+(107)? +(28)* +(36)* +(47)7} ~ 39-142 
No. of df = 10-1=9 
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Now from table, woos for 9 


Weal > woos 

. Hypothesis is certainly wr 
frequently in the directory. 

Ex. 16-11. /n the construction 
Jrom some logarithm tables and th. 
Digit 0 1 2 
Freq. 1439 1441 1461 14 


Use y?-test to assess the cor: 


chance of being chosen. 
Sol. Assuming that each digit h 
of each digit 


< 
It 


No. of df = 10-1=9 
Now woos for 9 d. f.= 16-92 


Weal > woos 
.. Hypothesis is wrong. 
Ex. 16-12. Five dice were thr 
thrown are given below : 
No. of dice showing 5 
4,5o0r6 
Freq. 8 


Calculate wy? 
Sol. The probability of getting 


“. By B.D., the expected freque 
of 


aes + 
2 


.. Expected frequencies are 
3, 15, 30, 30, 15,3 
Since the border frequencies < 
ones. Doing so 


Observed freq. 26 
Expected freq. 18 
a 64 + 25 
vo 18 30 


Ex. 16-13. Twelve dice were tl 
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2 are respectively 16:9, 18:3 and Now from table, woos for 9 d.f. = 16.92 


ignificance. 
desde numbers in the table’. Then *. Yea > Wo0s 
.. Hypothesis is certainly wrong and hence digits can’t be taken to occur equally 
frequently in the directory. 
Ex. 16-11. /n the construction of a table of random numbers, 15,000 digits were taken 
from some logarithm tables and the numbers of each digit obtained were as follows: 
Digit 0 1 2 3 4 5 6 7 8 9 
+0+1+25} | 


| Freq. 1439 1441 1461 1452 1494 1454 1613 1491 1482 1519 


Use w?-test to assess the correctness of the hypothesis that each digit had an equal 
chance of being chosen. 
Sol. Assuming that each digit had an equal chance of being chosen, expected frequency 


of each digit 
= 1500 
y? = 61)? + (59)? +(39)? +(48)? +6? + (46)? +(113)? 
the hypothesis may be correct. 1500 
distribution of faces were oe : ee ee 
Tot 
e - oi No. ofdf'=10-1=9 

“the ‘equal probability’ hypothesis. Now w2o05 for 9 d. f.=1692 


cpected frequency of each face 
Weal > Wo0s 
.. Hypothesis is wrong. 
Ex. 16-12. Five dice were thrown 96 times and the number of times 4, 5 or 6 was 
thrown are given below : 
No. of dice showing 5 4 3 2 1 0 
{ 4, Sor6 
Freq. 8 18 35 24 10 1 
Calculate y? 
Sol. The probability of getting a 4, 5 or 6 in a throw of a single die 
ai 
2 
.. By B.D., the expected frequencies are the successive terms in the binomial expansion 


5 
od + +] 
22 


. Expected frequencies are 
3, 15, 30, 30, 15, 3 


Since the border frequencies are small, these are to be combined with the adjacent 
ones. Doing so . 


25} = 12-9 


ition of digits in numbers chosen at 


6 7 8 9 
107 972 964 853= 10,000 
ally frequently in the directory. . 
mtly in the directory’. Then capwied 


of 


” +(34)? +(75)? +(67)” Observed freq. 26 35 24 1 
Expected freq. 18 30 . 30 18 
47)"} ~ 39-142 : 
en 64 ae 25 36 + 49 ~§8-3]1 


+ 
18 30 30 18 ~ 
Ex. 16-13. Twelve dice were thrown 4096 times and a throw of 6 was reckoned as a 
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success; the observed frequencies are given below : 
No. of successes 0 1 2 . 3 4 5 6 7. 
Freq. 447 1145 1181 796 380 115 24 8 


Find the value of wy? on the hypothesis that dice were unbiased and hence show that 


and over 


the data are consistent with the hypothesis so far as the y?-test is concerned. 


Sol. On the hypothesis of unbiased dice the theoretical frequencies are the successive 
terms in the binomial expansion of 
12 
4096 (2 + | 
6 6 


1 
as the probability of success with a throw of one die is P34 


“. Expected frequencies are 
459; 1102; 1212; 808; 364; 116; 27 and 8. 
2 (12)? (43)? (31)? (12)? ne 


459 1102 1212 808 364 116 27 8 
= 4:00 


(16)" P 3 , 6-8" 


No. of df = 8-1=7 


Now Woos? for 7 d.f.=14-07 


“e Wal < 7005 
. The data is consistent with the hypothesis. 
Ex. 16-14. A set of 6 similar coins is tossed 640 times with the following result : 


No. of heads 0 1 2 3 4 5 6 

Freq. 7 64 140 210 132 75 12 

Calculate the binomial frequencies on the assumption that the coins are symmetrical 
and test the hypothesis. _ 


Sol. On the assumption that coins are unbiased, the expected frequencies are given by 
the successive terms in the binomial expansion of 


65 654 6543 6.5.4.3.2 ef 
5.4.3.2.1 


=10}1+6+—+ +——— 
2 32 43.21 


“. Expected frequencies are : 
10, ss 150, 200, 150, 60, 10 


Bice (10)? (10)? 8)" (15)? 
YWor= iS + + 
10 Bs 150 200. 150 60 7 
No. of d.f. = 7-1=6 
Now w7o0s for 6 d.f. = 12:59 


(eg) cineso | a 
640/—+—] =100+1) 
2° 2 . 


Weal < 7005 
.. Assumption may be correct. 
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Ex. 16-15. /2 dice were rollea 
had 5 or 6 on the uppermost face ' 
following table : 


No. of dice 0 1 
showing 5 or 6 
Freq. 185 1149 3 
9 10 
105 14 


Fit a binomial dist. and test for 
Sol. From the data, 


M. = Seay tl N49 + 653041 
+9 

_ 106602 

~ 26306 


Let p be the probability of occu 
binomial distribution mean = np, est 


_ 106602 
26306 
whos n=no. of dice = 12. 
ae 0:3377 
=1-p=0- 
ixpesiea Rees are suc 
2630 
.. Expected frequencies are : 
187, 1146, 3215, 5465, 6269 
Since expected frequencies of la: 
merged. 


2-2 3 
187 11 

i’ 

++ —. 

133 


Now since mean and total freque 
frequencies, 


No. 
Now Vos 
. Fit is good. 
Ex. 16-16. The following data si 
during 14 years. 
No. of suicides in 0 1 
a state per year 
Freq. 364 376 


5 
115 


were unbiased and hence show that 
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6 7. and over 


24 8 


le w?-test is concerned. 


etical frequencies are the successive 


32 _R)2 
12)" dey V3 By 


ae ae 


=7 
= 14-07 


< 700s 


6 2 8 


) times with the following result : 


3 
210 


imption that the coins are symmetrical 


, the expected frequencies are given by. , 


rel +1)° 


43 65 


4 
132 75 12 


432 +1) 


21 543.21 


30, 60, 10 


P08)" 
0. 150 60 10 


2 
"0.05 


2 2 
as 2 . 8-6 
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Ex. 16-15. /2 dice were rolled 26303 times and each time the number of dice which 
had 5 or 6 on the uppermost face was recorded. The results are given in the form of the 
following table : 


No. of dice 0 1 7 2 4 5 6 7 8 
showing 5 or 6 
Freq. 185 1149 3265 5475 6114 5194 3067 1331 403 


9 10 11° (12 
105 14 4 — 
Fit a binomial dist. and test for goodness of fit. 
Sol. From the data, 


AM. = saat 149 + 6530+ 16425 + 24456 + 25970 +18402 
+9317 +3224 +9454140+44} 
_ 106602 


26306 


Let p be the probability of occurrence of 5 or 6 in a throw of single die. Then since for 
binomial distribution mean = np, estimate of p is given by 


_ 106602 
"P ~ "26306 
where n=no. of dice = 12, 
he 0:3377 
= 1-p=0-6623 


6 Eapeeed aaa are successive terms in the binomial expansion of 
26306 (0:6623 + 0-3377)!? 
.. Expected frequencies are : 
187, 1146, 3215, 5465, 6269, 5115, 3043, 1330, 424, 96, 15, 1, 0. 
Since expected frequencies of last two classes are less than 5, last three classes are to be 
merged. 


> 2? 3% (50)? (10)? (155)?. (79)? (24)? 
Wo = sotto tee tt ot et ee tt 
187 1146 3215 5465 6269 5115 3043 


‘ 1? 2, _ 8- 16)? 
1330 424.96. +16 


Now since mean and total frequency have been used from the data to obtain expected 
frequencies, 


~ 8-201 


No. of df. = 11-2 =9 
Now Woos for 9 d.f.= 16-92 
Weal < wo0s 


.. Fitis good. 
Ex. 16-16. The following data shows the suicides of 1096 women in 8 Punjab cities 


during 14 years. 


No. of suicides in 0 1 2 3 4 5 6 7 
a state per year 


Freq. 364 376 218 89 33 13 2 1 
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Fit a Poisson distribution to the data and show that the fit is not good. ers = 0-3075). 
Sol. The parameter m of the Poisson distribution is to be obtained from the data itself. 
Since it is equal to the mean of distribution, we have 


m= | _ 0(364) +1(376) + 2(218) + 3(89) + 4(33) + 5(13) + 6(2) + 701) ~ 1-18 


1096 
.. The theoretical frequencies are 
1 Qer 
1096.e7 M8 Ot eeeeead 
x! 
ie., 337, 398, 235, 92, 27, 6, 1,0 
2 52 2. 42 ¢2 _ 2 
yw = (C0 a C7) es CD eg nC ~17-6 


337-398) )=— 235 92s 27 7 
merging last three classes as the expected frequencies of last two classes are less than 5. 
Here no. of classes = 6 (as last three classes have been merged) 
“. No. of d.f. = 6 — 2 = 4 (as mean and total freq. are kept same for expected and 


observed frequencies). 


Now woos for 4d.f.=9-49 


yal > y7o0s 
. Fit is not good. 
Ex. 16-17. May the following set of observations be regarded as those of a random 


sample from a Poissonian distribution, given e° > = 61. 
Deaths : 0) . ol 2 3 4 Total 


Freq. : 122 60 15 2 1 200 
Sol. As in Ex. 10-47, theoretical frequencies are 
122 61 15 2 0 


As expected frequencies of last two cells are less than 5. These cells are to be merged 
with preceding one 
Thus we have 


O; : 122 60 — 18 
e; : 122 61 17 
2 2 2 
2 _ (22-122) (60-61)° (18-17) 
122 61 17 
= oe ge ~ 0-08 
61 17 
No. of fi =3-2=1 
Yoos = 3-84 


2 2 
Wcal < W005 


. Given set of observations can be regarded as those of a random sample froma PD. _ | 


Ex. 16-18. Fit a normal distribution to the data given below and test the goodness 

fit: . 
Height (inches) 60-62 63-65 66-68 69-71 72-74 

| Freq. 5 18 42 27 8 
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Sol. The A.M. mand s.d-c of 
respectively. 


The calculations are arranged 


Class 
Heights | boundaries 
(X) 


In 2nd column class bound: 
ze X — 67-45 
2-92 

the various values of Z are written, ] 
obtained by subtracting the success’ 
have the same sign and adding therr 
in the table above). In 6th column 
entries in 5th column by total frequ 


ces 
i. 413 


are written and in: 


Since mean, s.d. and total frequ 
frequencies, number of d. f.= 5-3 


Now y 
“. Fit is good. 


1, Ina sample of peas from coffee 
of angular peas is 101. Is this 
ratio in which they should occu 


2. Ina Mendalian experiment on pe 
to occur in the proportion 9 : 3: 
observed frequencies were respi 
results correspond with the thec 
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tthe fit is not good. (e!8 =0 3075). 
is to be obtained from the data itself. 


33) + 5(13) + 6(2) +70) = 1-18 


es of last two classes are less than 5. 
ve been merged) 


_freq. are kept same for expected and 


= 9.49 


“0.05 


ions be regarded as those of a random 


=-61. : 
3 4 Total 
ire 
2 0 


less than 5. These cells are to be merged 


J 70.05 


das those of a random sample froma 


data given below and test the good 


66-68 69-71 72-4 
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Sol. The A.M. m and s.d. of the given data can be easily shown to be 67-45” and 2:92" 
respectively. 
The calculations are arranged in the table below : 


Class Areas under 
Heights | boundaries normal curve | Areas for | Expected | Observed 


(X) from 0 to Z each class Freq. Freq. 


In 2nd column class boundaries (X) are written, in 3rd column the values of 
_ X-67-45 

~ 2.92 
the various values of Z are written. In Sth column areas for each class are written. These are 
obtained by subtracting the successive areas in the 4th column when the corresponding Z’s 
have the same sign and adding them when Z’s have opposite signs (which occurs only once 


in the table above). In 6th column expected frequencies are written by multiplying the 
entries in 5th column by total frequency 100. 


2 _ (5-4-13) 18-20-68)" (42-38-92)? 
4-13 20-68 38-92 


are written and in 4th column areas under the normal curve from Z = 0 to 


27-71 7:43 0° 7 
Since mean, s.d. and total frequency have been used from the data to obtain expected 


0-84 


frequencies, number of d. f.= 5-3 = 2, 
Now woos for 2 d.f.=5-99 
2 2 
oe Yocal < W005 
.. Fit is good. 
EXERCISES 


1. Ina sample of peas from coffee plants the number of round peas is 336 and the number 
of angular peas is 101. Is this in agreement with the Mendalian hypothesis that the 
ratio in which they should occur is 3 : 1? 


[Ans. y? = 0-8] 
2. Ina Mendalian experiment on pea-breeding the four possible seed varieties are expected 
to occur in the proportion 9 : 3 : 3: 1. In one experiment involving 720 trials the actual 


observed frequencies were respectively 396, 139, 129 and 56. Examine whether these 
results correspond with the theory. 


[Ans. y* = 3-27] 


10. 
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. Genetic theory states that children having one parent of blood type M and the other of 


blood type N will always be one of the three types M,MN,N and the proportions of 


three types will on average be 1 : 2: 1. A report states that out of 300 children having ~ 


one M parent and one N parent, 30% were found to be of type M, 45% type MN and 
the remainder type N. Test the hypothesis by w? — test. 


[Ans. y? = 4.5, Hypothesis may be correct] 


. Find the value of y? for the following table : 


Class A B Cc D E 
Observed 8 29 44 15 4 
Expected freq. 7 24 38 24 7 
[Ans. 6-8] 
. Find the value of y? for the following table : 
Class A B Cc D E 
Observed freq. 8 29 47 16 4 
Expected freq. 7 24 38 24 7 
[Ans. 7-3] 


. The following table gives the number of aircraft accidents that occurred during the 


various days of the week. Find whether the accidents are uniformly distributed over the 
week. 


Days Sun Mon Tue Wed Thu _ Fri Sat Total 


No. of accidents 14 16 8 12 11 9 14 84 


[Ans. Uniformly distributed] 


. Five coins are tossed 320 times and the following results are obtained. 


No. of heads 0 1 2 3 4 5 
Freq. 8 57 110 90 0 5 
Test the hypothesis that coins are unbiased. ' 


. 12 dice were rolled 4,096 times and a throw of 4, 5 or 6 is reckoned as a success, the 


observed frequencies are given below : 
No. of successes 0 1 2 3 4 5 6 7 8 
Freq. 0 7 60 198 430 731 948 847 536 
9 10 11 12 
25771 11 0 


Apply ?-test to test whether dice can be regarded as unbiased. 
[Ans. Dice can’t be regarded as unbiased] 


. Five dice were thrown 192 times and the number of times 3, 4 or 5 were thrown are 


given below : 

No. of dice throwing 5 4 3 2 ‘1 0 
3,4o0r5 : 

Freq. 6 46 70 48 20 3 

Calculate y? [Ans. 16°6] 


The following is the distribution of 106 eight pig litters according to the number of 


males in the litters : 
No. of males 0 1 2 3 4 5. 6 7 8 
No. of litters 6 5 8 22 23 25 12 1 4 


Total 
=106 
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11. 


12. 


13. 


14, 


15. 


16. 


Fit a binomial distribution und 


goodness of fit. (wos for 8c 


Records taken of the number c 
No. of male births ] 
0 
1 
2 
3 
4 


Test whether the data are cons 


and that the chance ofa male bi 


You may use the table given be 
D.F. 1 


5% value of y? 3-84 
One hundred and ninety-two f 
child being born ic otherwise 
among the first three children. 
No. of albinos . 0 

No. of families 77 
Find the expected frequencies : 


being born an albino and test tl 


Fit binomial distribution to the 
x: 0 1 2 3 
f: 3 8 11 15 
In 1,000 extensive sets of trials 
the number x of successes are f 
x: 0 1 Z 
f: 305 365 21 
Fit a Poisson distribution to the 
A systematic sample of 100 pas 
the observed frequency distrib 
follows : 
No. of foreign 0 
words per page 

Freq. 48 y 
Graduate the data by a Poisson 


by w?-test. 

The table below gives the numbe 
of 584 pages : : 
Mistakes per page 0 1 
No. of pages. 238 = 20 
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nt of blood type M and the other of 
»5 M.MN,N and the proportions of 
ites that out of 300 children having © 
o be of type M, 45% type MN and 


- test. 


= 4.5, Hypothesis may be correct] 


D E 
15 4 
24 7 
[Ans. 6-8] 
D E 
16 4 
24 7 
[Ans. 7:3] 


accidents that occurred during the 
its are uniformly distributed over the 


d Th Fn Sat Total 
11 9 14 84 
[Ans. Uniformly distributed] 
‘results are obtained. 
3 4 5 
90 0 5 


, 


, 5 or 6 is reckoned as a success, the 


4 5 6 7 8 
430 731 948 847 536 


led as unbiased. 


Dice can’t be regarded as unbiased] 
1 of times 3, 4 or 5 were thrown are 


3 2 1 0 
10 48 20 2 
[Ans. 16°6] 


ig litters according to the number of : 


Pe ce 
ectch cr tet 
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Fit a binomial distribution under the hypothesis that the sex ratio is 1 : 1 and test the 
goodness of fit. (wo.0s for 8d. f.=15-51). 


11. Records taken of the number of male and female births in 800 families. 


No. of male births No. of female births No. of families 


0 4 32 
1 3 178 
2 2 290 
3 1 236 
4 0 64 
800 


Test whether the data are consistent with the hypothesis that the binomial law holds 
and that the chance ofa male birth is equal to that of a female birth, namely q = p= ey, 
You may use the table given below : 

DF. 1 2 3 4 5 

5% value of y? 3-84 5:99 7-82 9-49 11:07 


[Ans. Binomial law does not hold] 


12. One hundred and ninety-two families (for each of which the possibility of an albino 


child being born ic otherwise established) had the following distribution of’albinos 
among the first three children. 

No. of albinos 0 1 2 3 Total 

No. of families 77 90 20 5 192 

Find the expected frequencies on the basis of a theoretical probability 0-25 of a child 
being born an albino and test the goodness of fit. 


rer Fit is good] 


13. Fit binomial distribution to the following data and test the goodness of fit : 


x: 0 1 2 3 4 5 6 7 8 9 =Total 
f: 3 8 11 15 16 14 12 11 9 1 100 


14. In 1,000 extensive sets of trials for an event of small probability the frequencies ‘f’ of 


the number x of successes are found to be 

x 0 1 2 3 4 5 6 7 
f: 305 365 210 80 «+ 28 9 2 1 
Fit a Poisson distribution to the data and test the goodness of fit. 


15. A systematic sample of 100 pages was taken from the Concise Oxford Dictionary and 


the observed frequency distribution of foreign words per page was found to be as 
follows : 


No. of foreign 0 1 2 3 4 5: 6 
words per page ; 
Freq. 48 27 12 7 4 1 7 


Graduate the data by a Poisson distribution and judge the goodness of your graduation 


by y?-test. 


16. The table below gives the number of mistakes committed per page in typing a manuscript 


of 584 pages : 
Mistakes per page 0 1 2 3 4 5 6 7 and above 
No. of pages 238 208 97 30 9 0 2 0 
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Graduate the data by a Poisson distribution and test the goodness of fit. Present your 
results in a tabular form. 

[Below are given values of y? with probability P of being exceeded in random 
sampling; n being the number of degrees of freedom : 


16.2.2. Test of Independence of Attributes 


Consider for example the attribute-heights of individuals. Then it may be divided into a 
large number of parts, e.g., very-tall, tall, medium-sized, short and very short. Thus, the 


given attribute A can be divided into a number of classes A,, A3,.....,A,. Similarly any other 
given attribute B can be divided into classes B,, By,.....,B,. Evidently when both attributes 
A and B are taken into account each one of the classes A,, A3....,A, would be divided into 


a large number of subclasses according to B,, B,,...B,. Such a classification is called 
manifold classification and a table of the following type is obtained. 


Attributes A 
A Ag eeseessaee Ajessesseees A Total 

B On Ojgeerereeees Oj jeservscens Or, (B,) 
B, 0, Oy3 .ecesccees Oyj scerssrees 03, (B,) 

B : : 
B, 0; Ojnsreevisees 0; widssuieds 01 (B;) 
B, 0s, 0,2 deeeecceee Oy PTY Y TTT TT Ou, (B, ) ‘ 

Total (A;) (Ag) ercceeees (A;): saadeaues (A;) N 


Such a table is called sx + contingency table. Here N is the total frequency, O; , is the 


frequency of (i, j)th ceil (i.e., a place common to 7th row and jth column), 


(B,), (Bg), .00 (B,) are totals of rows and (A;),(Az),..0 ,(A,) are column totals, 
Evidently 
N =(A,)+(A,)+...4+(A,) = (B,) + (Bz )+...+(B,) ...(1) 


To test whether there is any. relationship between 4 and B, the independence of two. 


attributes is assumed (Null hypothesis). On the basis of this hypothesis expected frequencies 
of various cells are obtained by keeping the row and the column totals for expected frequencies 
same as for observed frequencies. 
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Now proportion of individual: 


Since A has no influence on B 
the classes (A, ), (A ),....00006 ,(A,) 


. Expected number of indivic 


Knowing expected frequencie: 

No. of degrees of freedom as: 

There are in all s.t, cells. Sincx 
observed frequencies, there are (s - 
independent linear constraints 

: v = The no. of d:: 


This is the number of cells wh 

Ex. 16-19, An opinion poll w 
reform in 100 members of each of 
below : 


Favourable 
Party A 40 
Party B 42 


Test for independence of react 
2 df. = 5-99), 

Sol. Assuming the independen 
expected frequencies is as below : 


Favourable 
82 x L00 
Party A —~—— = 41 
arty 200 
82 x100 
Party B = 4] 
Mack 200 
w 
No. of d.f. = (2—1)(3-1) =2. 
Now Voos for 2d. f. 
Wi 


.. Hypothesis of independence 
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the goodness of fit. Present your 


2 of being exceeded in random 


qals. Then it may be divided into a 
|, short and very short. Thus, the 


4,, Ags-++4,- Similarly any other 
3,. Evidently when both attributes 
11 Ag-+94, would be divided into 


,. Such a classification is called 
>is obtained. 


\ 

A jessssseees A, Total 
eee (B,) 
Yj vesneeenee 02, (B,) 

D psoantass 0; (B,) 
Disrsysit Ow (B,) 
AJ eietieeAy) N 


e Nis the total frequency, O,, is the 
n to ith row and jth column), 


Sere (A,) are column totals, 


. (1) 
A and B, the independence of two. 
his hypothesis expected frequencies 
ylumn totals for expected frequencies 


| 
| 
4 
q 
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Now proportion of individuals belonging to class B; in the entire data 


ABD 
ae 
Since A has no influence on B, this proportionality is expected to be maintained in all 


the classes (A, ), (Az ),...++0000 s(A,). 
.. Expected number of individuals belonging to (i, j) th cell 


_ (AB) 
N 


Knowing expected frequencies independence is tested by applying y?-test as usual. 

No. of degrees of freedom associated with a sxt contingency table. 

There are in all s.t. cells. Since row and column totals are kept same for expected and 
observed frequencies, there are (s+) constraints. Because of (1) there are only (s+t—-1) 
independent linear constraints 

v = The no. of df. = s.t.-— (s+t-1) 


(s—1)(¢-1). 

This is the number of cells whose frequencies can be arbitrarily assigned. 

Ex. 16-19. An opinion poll was conducted to find the reaction to a proposed civic 
reform in 100 members of each of the two political parties. The information is tabulated 
below : 


tt 


Favourable Unfavourable Indifferent 
Party A 40 30 30 
Party B 42 28 30 
Test for independence of reactions with the party affiliations (Given that y70.05 for 


2 d.f. = 5-99). 
Sol. Assuming the independence of reactions with the party affiliations, the table of 
expected frequencies is as below : 


Favourable Unfavourable Indifferent 
82 x 100 58 x 100 60 x 100 
—=4] = 29 _ 30. 
ee 200 200 200 
82 x 100 58 x 100 60 x 100 
= 4] = 29 = 30. 
Raye 200 200 200 


po Nee Ee be: eee: AAO 
= +445 5+ = +011 


ye = t+ tt = ~ 
41 41 29 29 41 29 1189 


No. of d.f. = (2—1)(3-1) =2. 
Now Waos for 2d. f. = 5:99 


2 2 
Vea < Vo.os 
.. Hypothesis of independence of reactions with the party affiliations may be correct. 
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ee 


Ex. 16-20. The following table shows the result of inoculation against cholera : 


Not-attacked Attacked 
Inoculated 431 5 
Not-inoculated 291 9 


Is there any significant association between inoculation and attack ? Given that 
yap {P= 0-074 for y* =3-2 
P=0-069 for wy? =3-3 


. Sol. Assuming the independence between inoculation and attack, erpested frequency 
table is : 


Not-attacked Attacked 
722 x 436 14x 436 
—— = 427-7 ——_—— = 8.3 
Inoculated 736 . 736 
722 x 300 14x 300 
-j Soo 90453 ee 
Not-inoculated 736 | 736 


: I 1 1 1 
te (33 4a oe es hag 
ye Hor 8-3 294.3 +3} 


v=No. of df = (2-1) (2-l=1 
Now for y?=3-2, P=0-074 
Now for y* =3-3,  . P=0-069 
Now when, w increases by 0-1, P decreases by 0-005 


0-005 
. When wy? increases by 0-08, P decreases by x 0-08 
= -0040. 
.. For y* = 3-28, P= 0-074-0-004 = 0-07. 


Thus, if the hypothesis is true, the data gave results which would be obtained about 7 
times in hundred trials. This is infrequent but not very infrequent. Moreover, the theoretical 


frequencies in the ‘attacked’ column are not very large. It will, therefore, be unjustified in < 


rejecting the hypothesis but it can be said that data lead us somewhat to believe that hypothesis 
is not correct i.e., inoculation and attack are associated. 
Ex, 16-21. From the following table 


Eye colour in sons’ 


Not light Light 
Not light 230 - 148 
Eye colour in fathers Al 
Light 151 471 


test the association between the eye colours of fathers and sons. 
Sol. Assuming that there is no association, the expected frequency table is 
Eye colour in sons 


Not light Light 
Not light 144- 234 
Eye colour in fathers ~ 
Light 237 385 
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No. of d.f.= (2-1) (2-l= 


Boe 
Yoos = 3°84 
2 
Wo 
a Assumption is wrong. 
Ex. 16-22. /n an experiment | 
results were obtained : 


Inoculated 

Not inoculated 

Examine the effect of vaccine 
Sol. Assuming that vaccine h 


Inoculated 
. Not inoculated 


W 17 

= 25 Es 

17 

No. of d.f. = (2-1) 
woos = 3-84 


y"oos > yo0s 

“. Assumption is wrong. 
Ex. 16-23. Examine by any s 
voting preference in the election fi 


Vote for —> A 
Area 

Rural 620 
Urban 380 
Total 1,000 


Sol. Assuming the independe 
table of expected frequencies is : 


Vote for —> A 
Area 

L 
Rural 550 


Urban 450 
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Attacked 


14 x 436 
736 


14 x 300 
736 


aoe sgrgt eg }o378 
8.32943. 5-7 


I)(2-H=1 


=8-3 


=5-7 
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* x0-08 


:0-07. 
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CHI-SQUARE DISTRIBUTION - , 623 


2 _ (230- 144)? , M48- 234)? ,Usl- 237) pas 385)" 
y 


~ 133. 


No. of d.f. = (2-D(2-D=1 


Woos = 3:84 


2 2 
Vea > Vo.os 
.. Assumption is wrong. 
Ex. 16-22. /n an experiment on immunization of cattle from tuberculosis the following 


results were obtained : 


Affected Unaffected 
Inoculated 12 28 
Not inoculated 13 7 


Examine the effect of vaccine in controlling the incidence of the disease. 
Sol. Assuming that vaccine has no effect on disease, expected frequency table is, 


Affected Unaffected 
Inoculated _ 17 23 
Not inoculated 8 12 
2_ (17- 12)” 3 28) Peles 13)? , (2 7)" 
v 17 3 8 12 
Ld ee rae ~7-8. 
17 23 8 12 
No. of c.f. = (2-lQ-D=1 


y"o0s = 3-84 


wos > W005 
*, Assumption is wrong. 
Ex. 16-23. Examine by any suitable method, whether the nature of area is elated to 


voting preference in the election for which the data are tabulated below : 


Vote for as A B Total 
Area os 

Rural ; 620 480 1,100 
Urban 380 520 900 
Total 1,000 1,000 2,000 


(Woos” for 1d. f.=3-84) 
Sol. Assuming the independence of voting preference and the nature of the area, the 


table of expected frequencies is : 


Vote for > A B 
Area 
L . 
Rural 550 550 J 
Urban 450 450 
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Pn FOF IO? , TOF, 100" 99.5 ys forte 


“. Assumption is wrong. 

Ex. 16-24. An investigator into chocolate consumption divided India into eight areas 
and took a random sample from each, the individuals so obtained being classified as 
consumers or non-consumers of chocolate. His results were as follows : 


Area number : 1 2 3 4 5 6 7 8 Total 
Consumers : 56 87 142 71 88 72 100 142 758 
Non-consumers: 17 20 58 20 31 23 25 48 242 
Total 73 107 200 91 119 95 125 190 1,000 


Do these results suggest that the consumption of chocolate varies from place-to-place. 
Sol. On the assumption that areas and chocolate consumption are independent i.e., 
chocolate consumption does not vary from place-to-place, the expected frequency table is 


Area number : 1 2 3 4 5 6 7 8 

Consumers : 55 81 152 69 90 72 95 144 

Non-consumers: 18 26 48 22 29 23 30 46 
wy? = 628 


No. of d.f. = (2-1) (8-1) =7 
From tables, woos for 7 d. f.= 14-07 
Weal $ W005 
“, Assumption may be correct. 
Ex. 16-25. Deduce that for a s x t contingency table y? < N(s-1) or y? <$ Nt-1) 
whichever is less. 


Sol. Let ¢ be the expected frequency of (i,/)th cell. 


(A, )(B; 
Then ee ABT 
N 
s t 2 2 
(0; ~ ey) 0; 
Now yw? = pO eae) Pa pre kd 
i=l jal i i fl 


li i 
-M 
ay, ._M 
~M = gto 
oo 
S\e 
Sle 
Zle 
=| 
sls 
> 
+ 
= 


Now since 0; $(A;), 


| 
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= N(s 
Similarly y? < N(t-1), 
w? <min. [NC 


Co-efficient of Contingency 
The co-efficient of conting: 


f 


where N = tota 
Yates Correction of Cont 


of w? as below : 


Jn general, correction is m 
large samples this yields practi 


can arise near critical values. 
Ex. 16-26. Show that in a 


y? calculated from the hypot 


a’ Ce 


Sol. Let Ur Nae be the expe 


Then a= ce 
ae 

(a: 

ae 
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-o obtained being classified as 
re as follows : 
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72 100 142 758 : . . » 4 = B; 
23 25 48 242 
95 125 190 1,000 N(s~-1). 
olate varies from place-to-place. Saitarlyg? SNOSD: 
msumption are independent 7.e., 
, the expected frequency table is 


y? <min.[M(s—1), N(t-D). 


7 8 
Pi Es 95 144 Co-efficient of Contingency 
29 23 30 46 The co-efficient of contingency (C) is given by 
ae N+ y? 
where ‘N = total freq. 
) Yates Correction of Continuity. This correction consists in modifying the definition 


| of w? as below : 


> yw? sN(s—l) or y? <N(t-) oar yl ~e,|-0-5}? 


WV a (a ie 


t 
In general, correction is made only when the number of degrees of freedom is v=1. For 
large samples this yields practically the same results as the uncorrected y? , but difficulties 


can arise near critical values. 


ab 
Ex. 16-26. Show that in a 2x2 contingency table where in the frequencies are “| 7’ 


yw calculated from the hypothesis of independence is 


(at+b+c+d) (ad — be)? 
(a+b)(c+d)(at+c)(b+d) 


t f 


Sol. Let Gi} be the expected frequencies obtained on the hypothesis of independence. 
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Similarly (b-b')* =(c-c')* =(d-d') __(ad~be)" _ 
(a+b+c+d)* 
ee ppc (ad - bc) a 
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(at+b)(a+c)(b+d) (a+c)(b+d)(c+d) 
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Ex. 16-27. Show that for a 2 x n contingency table, 
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where {11,,}2, are the two frequencies in the rth column and N,,N, are the marginal 
sums of the two rows. 


‘Sol. Let Tir and "2, be the expected frequencies in 7th column. 
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Ex. 16-28. Show that for ent. 
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Ex. 16-28. Show that for entries in 2 x r contingency table, 
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Sol. From Ex. 16-28. 
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EXERCISES 
1. Find the value of y? for 2 x 2 contingency table : 


Hair colour ~—> 


Eye colour | 


[Ans. 12-6] 


2. Ina locality 100 persons were randomly selected and asked about their educational _ 


achievements. The results are given as below : 


Education 
Middle High School College 
Male 10 15 25 
Sex 
Female 25 10 15 


Can you say that education depends on sex ? 
: [Ans. 9-9, Education depends on sex] 
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3. From the following table find 


ae 


a 


Fair 
Boys 592 

Sex 
Girls $44 


In an experiment with immuni 
were obtained : 


Inoculated 

Not Inoculated 

Examine the effects of vaccine 
[Ans. Vaccine is ef 

In an experiment on the immun 

obtained. Derive your inferenc 


Inoculated 


Not Inoculated 


[Ans. 
In experiments on the Spahling 
obtained : 
Died 
a 
Inoculated 
Not inoculated or 
inoculated with 
control media 
Total 


Find the value of yw? and test t 


. The table below gives the data « 


Inoculated 
Not inoculated 
Test the effectiveness of inocule 


- Can vaccination be regarded as z 


following data ? 
of 1,482 persons exposed to smz 
1,482 persons, 343 were vaccin 


. From the following data test whe 


economic conditions : 


E> 
Good 
Economic conditions 
Not Good 
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[Ans. 12-6] 


and asked about their educational . 


sh School College 
15 25 
10 15 


1s. 9:9, Education depends on sex] 
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3. From the following table find whether the hair colour and sex are associated : 
Hair Colour 


Fair Red - Medium Dark Jet black 
Boys 592 119 849 504 36 
Sex 
Girls 544 97 677 451 14 
[Ans. Associated] 


4. In an experiment with immunization of cattle from tuberculosis, the following results 
were obtained : 


Affected Unaffected 
Inoculated 12 26 
Not Inoculated 16 6 


Examine the effects of vaccine in controlling the susceptibility to tuberculosis. 
[Ans. Vaccine is effective in controlling the susceptibility to tuberculosis] 
5. Inan experiment on the immunization of goats from anthrox the following results were 
obtained. Derive your inference on the efficiency of the vaccine : 


Died Survived 
Inoculated 2 10 
Not Inoculated 6 6 


[Ans. Survival is not associated with inoculation of vaccine] 
6. In experiments on the Spahlinger anti-tuberculosis vaccine the following results were 


obtained : 
Died or seriously Unaffected or Total 
affected not seriously affected 
Inoculated 6 13 19 
Not inoculated or 
inoculated with 8 3 Il 
control media 
Total 14 16 30 
Find the value of yw and test the independence. 
[Ans. 4-7] 
7. The table below gives the data obtained during an epidemic of cholera : 
Attacked _ Not attacked 
Inoculated 31 469 
Not inoculated 185 1,315 


Test the effectiveness of inoculation in preventing the attack of cholera. 
[Ans. Inoculated is effective] 
8. Can vaccination be regarded as a preventive measure for small pox as evidenced by the 
following data ? 
of 1,482 persons exposed to small-pox ina locality, 368 in all were attacked. Of these, 
1,482 persons, 343 were vaccinated and of these only 35 were attacked. 
9. From the following data test whether there is any association between intelligency and 
economic conditions : 


Intelligency 
Excellent Good Medium Dull 
Good 48 200 150 80 
Economic conditions 
Not Good 52 180 190 100 
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-10. A producer of a certain film claimed that his movie was not liked equally by men and 
women. Accordingly, a sample of men and women was collected. The following are 
the number of men and women falling into each of the five classes : 


Most liked More liked Liked Not much Disliked 
liked 
Men 110 591 840 500 30 
Women 90 549 670 450 20 


Is producer’s remark supported by data ? 
11. The following table shows the association among 1000 school boys, their general ability 
and their mathematical ability. Calculate the co-efficient of contingency between the 


two. 
General ability 
Good Fair Poor 
24 Good 44 22 4 
Maths ability Fair 265 257 178 
Poor 41 91 98 
12. The following data observed for hybrids of Datura : 
; Flowers 
; Violet White 
Prickly - 47 21 
Fruits 
Smooth 12 3 


Apply yw? -test-to-test the association between colour of flowers and character of fruits. 
Given that 


y =p JP = 0-402 for y* =0-7 
P =0-399 for y? =0-71 
[Ans. There is no association] 


13. From the following table, test the hypothesis that the flower colour is independent of 
flatness of leaf :. 


Flat leaves Curled leaves Total 
White flowers 99 36 135 
Red flowers - 20 > 25 
Use the following table giving the values of w? for 1 d.f. for different values of P. 
| oe 0:5 0-1 0-05 
yes 0-455 2-706 3-841 


[Ans. wy? =0:5] 
14. Candidates for a degree in Mathematics are required to pass a subsidiary examination 
in Physics. The table below gives the number of candidates classified according to the 


grading awarded in two subjects. Test if the performances in the two subjects are 
independent. 


Class in Maths 
I II Pass Fail 
I 38 60 50 11 
Class in Physics II 24 70 100 27 
"Pass 12 36 91 25 
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15. Sixteen pieces of photogray 
from nearly white to a ver 
sheet and pasted on cards, 
scraps from the several she: 
pack. Twenty observers the 
each tint either ‘light’, ‘mec 
The tollowing table shows t 


2 
ze L 
2 Light 

& 2 Medium 

E = Dark 
Zz Total 2 


Show that there is a significar 
the name assigned to the oth: 
16. The following table gives thi 
nature of work. Test whether t 


Skilled 
Male 40 
Female 10 
Total 50 
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[Ans. There is no association] 


it the flower colour is independent of 
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15. Sixteen pieces of photographic paper were printed down to different depths of colour 


16. 


from nearly white to a very deep blackish brown. Small scraps were cut from each 
sheet and pasted on cards, two scraps on each card one above the other, combining 
scraps from the several sheets in all possible ways, so that there were 256 cards in the 
pack. Twenty observers then went through the pack independently, each one naming 
each tint either ‘light’, ‘medium’ or ‘dark’. 

The tollowing table shows the name assigned to each of the two pieces of paper : 


= Name assigned to upper tint 

oe Light Medium Dark Total 

2» Light 850 571 580 2001 

© 2 Medium 618 593 455 1666 
22 Dark 540 456 457 1453 

Zz, Total 2008 1620 1492 5120 


Show that there is a significant association between the name assigned to one piece and 
the name assigned to the other. 

The following table gives the classification of 100 workers according to sex and the 
nature of work. Test whether the nature of work is independent of the sex of the worker : 


Skilled Unskilled Total 
Male 40 20 60 
Female 10 30 40 
Total 50 50 : 100 
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t, F and Z Distributions and Small 
Sample Tests 


17.1. Introduction 


Let x}, x2, ..... x, be the members of a random sample drawn from a normal population 
with mean yu and s.d. o. 


= 12 

Let x= bs x; 
nz 
i=] 
1 nh 

and rs -> (x;- x) 

n iz) 

The joint distribution of x), x2, .... X, is 
1 


: 
1 < 2 

P = Sra D (x - Ax), dX2, .... AX, 

(ae) on at & | eee " 


{(x;-%)+(%-p)}? 


-Ms= 
tt 
M= 


Now (x;- p)? 


ul 
~ 
~ 

i] 
_ 


= ¥ {(x; -¥)° +(¥-p)? +2(@-n)(x;-3)} 


i=l 
= ns? +n(%—p)* 
Represent the sample values (x), X, ...... Xn) by a pt. P with Co-ordinates (x), x2, .... Xp) 


in Enclidean hyperspace of 1 dimension. Let O be the origin. Let OL be the line through O 
with direction ratios (1, 1, .... 1). Draw PM L OL. 


P(X,,X5-..X,) 


[@) M L 
Let co-ordinates of M be (0, ©, .... c1)-where (1 # 0) 
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2 Y (y-¥)? +nF—n)? f ¥ (i-¥) = | 
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Than d.r.’s of OM are a, ©, ..... 

and d.r.’s of PM are x, ~ a, x; 
Since PM 1 OM, 

Q(x) — &) + A(t. - a) +. 


Co-ordinates of M are (x 


PM = ( 

X 

= &£ 

[: 

PM = ,y 

and OM? = x7 4....4%2= Nn. 


If x ands are kept fixed, P mov: 
surface of a hypersphere of radius P 

’. The spherical shell in which j 
of length d(OM). 

’, As x increases by dx and s 


(PM)"* . d(PM) . d(OM) 


= {s 


= cc 
aP = cc 
= ¢ 
where c, and ¢ are constants. 
=> sand X are independent. 
Dist. of x is 
dP = €9 
X varies from — to ce, 
c2 is given by 
eo 
C2 f 
-o 
X- 
Put P 
o/ Vi 
io.0) 
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Than d.r.’s of OM are a, G, ....., & 
and d.r.’s of PM are x; — 0, x9» - O, ..... Xy— Oh. 
Since PM | OM, 
Q(x, — O&) + A(X. — OL) +... + (x, — &) = 0 


Xp +... Xp 


is and Small = era 
ts | .. Co-ordinates of M are (X, X,..... x). 


PM = (x,—¥)? +..... +(%,—-3)" 


x (=x) = ns? 


i=] 


Lee ne ee 


yle drawn from a normal population e PM = Vans 
and OM? = x7 +4....4%2= nx? => OM = Vnix. 


If x ands are kept fixed, P moves in (n— 1) dimensional space orthogonal to OL on the 
surface of a hypersphere of radius PM and centre M. 

.. The spherical shell in which P moves has thickness d(PM) and suffers a displacement 
of length d(OM). 

’, As x increases by dx and s by ds, P describes an element of volume proportional to 


(PM)"* . d(PM) . d(OM) 
ard Gr ph yt = {s vay Vn ds.Jn dé 


5 = constant s”~2 dsdx 
1)} ‘ 
dP = const. exp. 4— —> {ns +n(x—1)?} $5"? dsdx 
20? 
-p)? +2(% —n)(x; -¥)} 


1 ns? I (e-p)* 
. = yy ee ieee 
-py? ef Eo-n=d = 4qe 7% "ds ce? Ce ie 
i=] 
where c, and c are constants. 


i -ordi 1) X25 00 X, ee : 
ot. P with Co-ordinates (x1, x2 n) cose and @ ade ancenendent 


origin. Let OL be the line through O 


Dist. of x is 
1 (@-p)* 
ay) 
) dP = c2e 2 ov /n ax 
n 
x varies from — © to oe, 
. €2 is given by 
_1@-ny? 
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a alee 
or ave Je? dy=1 
-n 
= oe Vin =I 
ay ee 
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Dist. of x is 


roy 
> zis w (2) 
: n 


Dist. of s is 


2 = 
dP = ce 26% gh-2 dy 
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—— is an unbiased esti 
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17.2. Student’s t-Distribution 
Student’s ¢ statistic is defined by 


ae | 
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17.2. Student’s t-Distribution 
Student’s ¢ statistic is defined by 
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, where v=n—1 

RY 

= 2 

z | |e 
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o 
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Dist. of tis 


1 
)2! 
1 v i? 
dP = ——, Td ,0<P<co 


= 1 dt 
= Mowe 
v Bl ~, = 2a 
a. +5) 
v 


This distribution is known as student’s ¢-distribution with v d.f. 

Remark. f-distribution was first found by W.S. Gosset in 1908 in his paper entitled. 
‘The probable error of the mean’ written under the name of his student. Student defined 
his statistic as 

x-p 
RY 
and investigated its sampling distribution. Later on in 1926, Prof. R.A. Fisher defined his own 
statistic and gave a rigorous proof for its sampling distribution. He defined his statistic as 


t= 


n 
where & is a M(0, 1), w’ is a chi-square variate with n.d.f. and E, w’ are independent. 
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Distribution of Fisher’s t 


Since & and Ww are independ 
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The joint distribution 
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Marginal distribution 
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Distribution of Fisher’s t 
Since € and y are independent, their joint distribution is 
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which is chi-square variate with ( — 1) d.f. Fisher’s statistic ¢ takes the torm 
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= (FE) : . 


Bao oar: : | 
which is student’s ¢ statistic. Thus, student’s ¢ can be regarded as a particular case of 
Fisher’s t. 


17.2-1. Properties of t-distribution 
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6 
Y»=—-3 = —— 0asv> ow 
v—-4 
M3 
Also, Bi = —}=0,7,= yb; =0 
1) 
Recurrence formula for moments 
(2r~-1).....Q) > 
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(2r—-3).....1 a, 
= 
nae aaa om) eee tS 
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17.2-2. Chief Features of the t-Probability Curve 
The equation of the t-probability curve is 


ye 1 1 
1 vtl 
Wolz.5| ee 
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(1) Since on changing f to — ¢, y does not change, curve is symmetrical about t = 0 
Median = 0 

(2) y 0as |t| 3 &, 

.. Curve is asymptotic to t-axis at both ends 

(3) y decrease rapidly as |¢| increases. 

*. y is maximum for t = 0 

*. Mode =0 
Mean, Mode and Median coincide. 


17.2-3, Limiting form of t-distribution 
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which is the density f” of a standard normal variate. 
t-dist. becomes normal when v is large. 
m4 Example. /f x is t-distributed with k degrees of freedom, show that 
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has a beta distribution. 
17.3. t-tests 
Tests of significance based on ¢-distribution are called ¢-tests. Various t-tests are : 
(i) Test for single proportion. 
(i) Test for the difference of means. 
(tii) Test for the significance of an observed sample correlation co-efficient. 
(iv) Test for the significance of an observed regression co-efficient. 
(v) Test for the significance of a rank correlation co-efficient. 

All these tests are for small samples and are based on fundamental assumption that the 
parent population is normal. 
Rules of Decision 

. Let P 


Pit > to} 
; 2P {t> to} 

For various fixed values of P and for v ranging from 1 to 60; values of ¢ have been 
tabulated in the form of t-tables. For v > 60, t is considered as a standard normal variate. The 
value fo is called the critical value of ¢ at level of significance P and d.f.v. 


ty by 


To test the significance the calculated value of ¢ is compared with tabulated value at 
certain specified level of significance. Generally 5% or 1% level are taken. 

If calculated value of |t| exceeds tabulated value, the null hypothesis is rejected and the 
difference is said to be significant and if it is less than tabulated value, the hypothesis is 
accepted at the level of significance adopted. 

Remark 

In above rules both the ends of ¢ curve are considered and hence tests with these rules 
are called two-tailed tests. If, however, one tail is used tests are called single-tailed tests. 

Since t-curve is symmetrical about t = 0 

PXt2 to(Q)} = Pits—t)(a)} 
where is the level to significance and fy (&) the critical value of at level of significance a. 
= Pklt|> to (a)} = 2P{t> to (a)} 
a 


=> on Pit 2 (@} 
changing ato 2 a 
=> a =, P{t 2 t)(2a)} 


Hence for a single tailed test, the critical values of ¢ for level of significance a can be 
obtained from those of two tailed test by looking the values at level of significance 2a. 


17.3.1. Test for Single Mean 
Let x), X2, .... x, be a random sample from a normal population with mean uu. The 
problem here is to test “is the sample mean differs significantly from the population mean 


4” ? Assuming the null hypothesis. “There is no significant difference between the sample 
mean and the population mean”, the statistic. 
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which follows student’s ¢-distriby 


Ex. 17-1. 4 mechanist is m 
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is meeting the specification. 

Sol. Here ¥ =0:742,n= 


t = 

No. of df =10-1=9 
From tables fo.9s for 9 df = 

beal > 10-05 

x differs from u signific 
Ex. 17-2. Ten individuals ¢ 
heights are found to be inches 63 
data discuss the suggestion that. 
fort=1:8 P=0-% 
fort=1.9 P=0-9 
where P is the area to the left of 
Sol. Calculation of mean a: 


Assuming population mean | 

t= 

when ¢ increases by 0-1, P inerea 
when ¢ increases by 0-09 P incre: 


For ¢ = 1-89, P= 
Pr=2(1-P)=2(1 -| 
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f= 
Sidn 
where x = sample mean 
2 1 . =)\2 n 2 rae 
and So ay > (4;-%) ats (where s° is sample s.d.) 
n~ n- 


which follows student’s ¢-distribution with (7 — 1) d.f., is calculated. 


Ex. 17-1. 4 mechanist is making engine parts with axle diameters 0-700”. A random 
sample of 10 parts shows a mean diameter of 0-742” with as.d. of 0-04”. Test whether work 
is meeting the specification. 


Sol. Here ¥ = 0-742, n= 10, s =0-04 and up = 0-7”. 


t= Tan = (4) n-1 ~3-15, 
No. of df =10-1=9 
From tables fo.os for 9 df = 2:26 
beal > to.05 
X differs from 1 significantly and hence the work is not meeting the specification. 
Ex. 17-2. Ten individuals are chosen at random from a normal population and the 
heights are found to be inches 63, 63, 66, 67, 68, 69, 70, 70, 71 and 71. In the light of these 
data discuss the suggestion that the mean height in the universe is 66” having given that 
fort=1-8 P=0-947 
fort=1-9 P= ee eee 
where P is the area to the left of the ordinate at t. 
Sol. Calculation of mean and s.d. 
Total 
x: 63 63 66 67 68 69 70 70 7] 71 
X=x-68:-5 ~-5 ~2 -1 0 1 2 2% 33 3 -2 


xX: 25 25 4 1 0 1 4 4 9 82 
= y) 82 ai 
X =68+(-2) ~ 67-8, 5? = 35 _(- =8-2—0-04 =8-16: 
10 10 \.10 or 


Assuming population mean to be 66, p = 66 


X—p) p—_ (1-8)V9 5-4 
= -l= = =]. 
: ( s ¥8-16 eee 


when f increases by 0-1, P inereases by 0-008 


when ¢ increases by 0-09 P increases by Gra (0-09) = 0-0072 


For ¢= 1-89, P = 0-9542 
Pr=2(1 — P) = 2(1 — 0-9542) = 0-0916 > 0-05 


Difference is not significant at 5% level of significance and hence test provides 
no evidence against the population mean being 66” 


Ex. 17-3. Nine patients, to whom a certain drug was administrated, registered the 


Jollowing increments in blood pressure : 


7,3,-1,4,-3,5,6,-4, 1 
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Show that the data do not indicate that the drug was responsible for these increments. 
The values of t for 10, 9 and 8 df. at 5% level of significance are 2:23, 2:26 and 2:31 
respectively. 

Sol. Let x be the variable for the increment in blood pressure. 


Total 
xe 3 -1 4 -3 5 6 -4 1 18 
a a i? ne ae oe ne: rs | 
(x- x: 25 1 9 4 25 9 16 36 1 126 
xX = = =2, = = (126) = 15-75. 
Assuming that the drug was not responsible for the increments in blood pressure, | = 0 
2/9 
Vise 
No. of df. =9-~1=8 
to.os = 2:31 
teal < to.05- 


The data do not indicate that the drug was responsible for increment in blood 
pressure. 


Ex. 17-4, Ten patients to whom a drug administered registered the following additional 
hours of sleep : 
0-7, — 1-1, — 0:2, 1:2, 0-1, 3-4, 3-7, 0-8, 1-8, 2-0 
Compute the statistic you would use to determine whether these data justify the claim 
that the drug does produce additional sleep. 


Sol. ; Calculation of mean and s.d. 
x 0-7 -1-1 — 0-2 1:2 0-1 3-4 
x-X: -— 0:54 ~ 2:34 — 1-44 — 0-04 ~ 1-14 2-16 
(x-x): 0:2916 5-4756 2:0736 0-0016 1:2996 4-6656 
3-7 0:8 1-8 2-0 12-4 
2-46 — 0-44 0-56 0-76 
6:0516 0-1936 0-3136 0-5776 20-9440 
pe Shower Sop0a 
| 10 os 
Assuming that drug does not produce additional sleep, u = 0 
SION 
~ f2-0944 ~ 


Ex. 17-5. A certain stimulus administered to each of the 12 patients resulted in the 
following increases of blcod pressure : 
5, 2, 8,— 1, 3, 0, 6,-—2, 1,5, 0,4 
Can it be concluded that stimulus will be, in general accompanied by an increase in 
blood pressure ? Given that to.95 (10) = 2:23, to.95 (11) = 2-20, tfo.05 (12) = 2-18 where ty (n) 
denotes the value of ‘t’ for ‘n’ df. at ‘a’ level of significance. 
Sol. Calculation of mean and s.d. 
Total 
xX: 5 2. 8 -1 3 0 6 -2 1 5 0 4 31 
x: 25 4 6 1 9 0 36 4 1 2 0 16 185 
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all 


salts » = 8S _ (MY 8 
Tg oe, 44a 


Assuming that stimulus will not be accompanied by an increase in blood pressure, 
p=0. 
31 Vil _ 31vil 
12 1259 1259 ~ 


144 
No. ofdf. = 12-1=11 
Now to.os (11) = 2-20 
teal > 10.05 


‘. Assumption is wrong and hence the stimulus will be, in general, accompanied by 
an increase in blood pressure. 
Ex. 17-6. Show that the 95% fiducial limits for the mean \ of the population are 


xt Sto.s Deduce that for a random sample of 16 values with mean 41:5” and the sum of 


the squares of the deviations from the mean 135 (inches) and drawn from a normal 
population, 95% fiducial limits for the mean of the population are 39-9” and 43-1”. 
Sol. Since P{|t| < to.os} = 0:95, 95%, fiducial limits for the mean 1 are given by 


xX- 
I¢| = Pn S to.05 
S 
Sto.05 
, a < 
= |¥~pl ia 
= _ Sto.os ~ . Stoos 
be, ¥--08 << X+ 
in. er vn 
H = 16, % =41:5, S= a. 
“ ies: 16~1 


No. of d.f. = 16-1=15 
: to.os = 2-13 
95% fiducial limits are 


3 
41-5 ae (2:13) ie. 39-9 and 43-1. 


EXERCISES 
1. A machine which produces mica insulating washers of use in electric devices is set to 
turn out washers having a thickness of 10 mils (1 mil = 0-001 inch). A sample of 10 
washers has an average thickness of 9-52 mils with a s.d. of 0-60 mil. Test the 
significance of such a deviation. ‘Ans. t= 2-4] 
. Find the “student’s ?’ for the following variate values in a sample of eight 
-4,-2-2, 0, 2, 2, 3,3 
taking the mean of the universe to be zero. 
. Find student’s ¢ for the following variate values in a sample of ten : 
—6,-4, -3,-2,-2,0, 1, 1,3, 5 
taking 1 to be zero. [Ans. 0-7] 
. Ten individuals are chosen at random from a population, their heights are found to be 
63, 63, 66, 67, 68, 69, 70, 70, 71 and 71 inches respectively. Test whether.the mean 


. [Ans. 0:3] 


a 


= 


2 


10. 


11. 


12. 


13. 


14. 
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height is 69-6” in the population, given that for 9 d.f. P{|t| = 2-262} = 0-05. 
(Ans. || = 1-89] 


. The nine items of a sample had the following values : 


45, 47, 50, 52, 48, 47, 49, 53, 51 
Does the mean of the nine items differ significantly from the assumed population 
mean of 47:5 ? Given that 
P=0-945 for t=1-8 ma og 
y ee for 1=1-9 eee eel 
The gain (in bushels per acre) in yield due to the use of a variety of wheat in nine plots 
is as follows : 
16:3, 13-4, 3-8, 7:9, 2:6, 2:5, 9-6, 7-2 and 3:3 
Are the observations consistent with the hypothesis that the average gain is 7:5 bushels 
per acre ? [Ans. Yes] 
Ten individuals are chosen at random from a population and their heights are found to 
be inches 63, 63, 64, 65, 66, 69, 69, 70, 70 and 71. Discuss the suggestion that the 
mean height in the universe is 65 inches. (Given that for 9 df. to.95 = 2-262) 
[Ans. ¢= 2-02] 
The table signifies additional hours of sleep gained by 10 patients in an Seponneent 
with a sleeping drug : 
Patient 1 2 3 4 5 6 7 8 9 10 
Hours gained 0:7 -1-1 -0-2 -12 01. 34 3:7 08 19 20 
Assuming that the hours of sleep is a normally distributed variable, calculate ‘f° for 
the above table. [Ans. 1-9] 
A certain drug caused the following increases in blood pressure of 12 patients. 


3, 0, 6,—2, 1, 5, 2, 8,0,-1, 1,5 


Can it be concluded that the stimulus does not effect blood pressure? [Ans. t= 2-6] ee 


The mean weekly sale of the ice cream bar was 146-3 bars. After an advertising 
campaign the mean weekly sale in 22 shops for a typical week increased to 153-7 and 
showed a standard deviation 17-2. Is this evidence that the advertising was successful ? 
(Given that for d.f. = 21, fo.95 = 2-08] {Ans. ¢= 1-97] 
A certain colliery is supposed to supply coal of ash content about 15. To test this 20 
random samples of the colliery’s coal are selected and tested. The null hypothesis is 
that the ash content is in fact 15. The results of 20 tests gave an average ash content of 
16-8 with a standard deviation of 3-6. Is this sufficient evidence for rejecting the 
hypothesis ? 
(Given that for 19 d.f. fo.05 = 2-09) [Ans. ¢= 2-18] 
A sample of 11 rats from a central population had an average blood viscosity of 3-92 
with a s.d. of 0-61. On the basis of this sample establish 95% confidence limits of 1, 
the mean blood viscosity of central population. [Ans. 3-51 and 4: 33] 
The average breaking strength of steel rods is specified to be 17-5 Ibs. To test this a 
sample of 14 rods was tested and gave the following results (in unit of 1,000 Ibs) : 
15, 18, 16, 21, 19, 21, 17, 17, 15, 17, 20, 19,:17, 18 

Is the result of the experiment significant? Also obtain the 95% fiducial limits from 
the sample for the average breaking strength of steel rods. {Ans. t= 0-68] 
The mean of I.Q’s of 10 boys is 97-2 with the sum of the squares of the deviation from 
the mean of 1833-6. Do these data support the assumption of a population mean I.Q’s 
of 100 ? Find the 95% confidence limits for the population mean. 

[Ans. ¢= 0-62, 107-41 and 86-99] 
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15. 


16. 


17 


A sample of 20 items has mean 42 units and standard deviation 5 units. Test the 
assumption that it is a random sample from a normal population with mean 45 units, 
Also obtain 95% fiducial limits. 

(Given that fo.95 = 2:09 for 19 d.f.) [Ans. t= 2-6, 39-6 and 44-4] 
A drug was administered to 10 patients and the increments in their B.P. were recorded 
to be 


o, 3,-—2, 4, —3, 4, 6,0, 0,2 
Is it reasonable to believe that the drug has no effect on B.P. ? 
The weights of 15 bags of sugar taken from the filling machine are given below (in 
kgs) 
16-1, 15-8, 15-9, 16-1, 16-2, 16-0, 15-9, 16-0, 15-7, 15-7, 15-8, 16-0, 16-0, 15-8, 15+7 
If the machine is supposed to be giving 16 k.g. of sugar per bag on an average, test 
whether the data suggest that the machine is failing in its purpose. 


17.3-2. Test for the difference of means - 


Let x1, 2, ... i, and yj, yo, .. 


» Yny-be two independent random samples with means X, y 


and standard deviations s,, s2 respectively from two normal populations with the same. 
variance ©. The problem is to test the hypothesis that the population means are j1;, and Lo. 
Assuming the population means 1; and fy, the statistic is defined as 


x~Y ~ (4) -H2) 


t = 
S Dg 
nN n2 
] . a2 |X =e) 
= xj-X) + i- 
where s ete +d o-39) 
= ——— [nsf +n 53] 
ny +nz -2 
Now x ~ N(u1,0/ fn) 
and 


¥~N(Mz,0/ {ng ) 


7 t 1 
: Mmi-tavo [E+ 2) 
ny ny 
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bad 
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[a 1 
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m Ny 
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2 
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2 
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where v = ny +n) — 2, is a y’ variate with v = (nm, + m2 —2) af. 


2 


(¥ -F) — (wy — Hp) vs? t 
2 


. v 1 . 
is Bz (:. ;| variate. 
Statistic ¢ follows ¢-distribution with v = n, +n.-2 df. 
If the hypothesis to be tested is “Are the two population means same or the two sample 


means differ significantly”, under the null hypothesis “population means are same i.e., 11) = 
Hy or the two sample means do not differ significantly” the statistic to be calculated becomes 


which also follows ¢-distribution with (, + nz - 2) df. 

(ii) If (i) m = m, = n(say) and (ii) the samples are not independent but the sample 
observations are paired together i.e., the pair of observations (x;, y;) (i= 1, 2, ... n) correspond 
to the same (ith) sample unit. The problem here again is to test “Are the sample means differ 
significantly”. 


Under the null hypothesis “sample means do not differ significantly” the statistic 


eae 
** sin 
where dj = xj-y; 


1 n 
d = —) d;and 
n= 
i=] 
l< =2 
8 = — PD (d,-d) 
nl jz 
which follows distribution with (n — 1) df, is calculated. 
Ex. 17-7. For a random sample of 10 pigs fed on diet A the increases in weight in 
pounds in a certain period were 
10, 6, 16, 17, 13, 12, 8, 14, 15,. 9 Ibs. 
For another sample of 12 pigs, fed on diet B the increases in the same period were 
7, 13, 22, 15, 12, 14, 18, 8, 21, 23, 10, 17 Ibs. 
Test whether diets A and B differ significantly as regard the effect on increases in weight 


(or test whether the mean increases in the two samples are significantly different). You may 
use the fact 5% value of t for 20 df. is 2-09. 


Sol. Calculation of mean and s.d. 
B dee 10 6 16 17 13 12 8 14 15 9 Total 
=x-13: -3 +7 3 4 0 -1 -5 1 2 -4 -10 
xX: 9 49 9 16 0 1 25 1 4 16 130 


ys 7 13. 22 15 12 14 18 8 2 23 10 17 
Y=y-14: -7 -t 8 1 -2 0 4 -6 7 9 -4 3 W2 
Y:. 49 1 6 1 4 #0 6 36 49 81 16 9 326 


e 
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_ 1 
~ 10412-2 


Assume the diets A and B don 
weight i.e., the mean increases in t 


eS {ana 


Now | = 

No. of d.f. = 
Now foos for 20d,f. = 
‘ teal < 


Assumption may be correct. 


Ex. 17-8. The following data 
divisions of equal areas of two agri 
the same as Plot II except for the a 


Plot]: 6:2 5-7 6:5 

Plot IT: 5-6 5-9 5-6 
Is there a significant difference 
between their means as a criterion 


x: 62 5:7 65 
X=x-¥: 02 -03 055 
X*: 0:04 0-09 0-25 
y: 56 59 5-6 


Y=y-y: -0-1 02 -01 
Y’: 0:01 0-04 0-01 
_ xx 
x= ra 6 and y 
= : (0-:64+0 

10+10-2 
im 0:3 
0-44 f/1 1 
—— — + — 
9 YV10 10 
No. of df = 
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tion means same or the two sample 
opulation means are same i.é., 4) = 
ie statistic to be calculated becomes 


re not independent but the sample 
ons (x;, »;) (i = 1, 2, ... 2) correspond 
0 test “Are the sample means differ 


iffer significantly” the statistic 


‘ted. 
n diet A the increases in weight in 


5,. 9 Ibs. 
creases in the same period were 


3, 10, 17 lbs. 
ard the effect on increases in weight 
are significantly different). You may 


nd s.d. 
8 14 15 9 Total 
~5 1 9° sae 210" 
25 1 4 16 130 


8 21 23 10 17 
—6 7 9 -4 3 
36 498i 16 9 


= 
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and 


> 2 
8) =) for x= py for x -(F) =13-1=12 


326 


314 


5° = U2 for y= [2 for Y=-——-1 = —— 


12 


10+12-2 


12 


igs oo oh es {aoa2)+02) SX em =21-7 


Assume the diets A and B do not differ significantly as regard the effect, on increases in 
weight i.e., the mean increases in two samples are not significantly different. 


12~15 
Now \t| = 
No. of d.f. = io 12-2=20 
Now toos for20d.f. = 2-09 
: tal < to.05 


Assaniption may be correct. 
Ex. 17-8. The following data represent the yield in bushels of Indian corn on ten sub- 


Plott: 
Plot IT: 


37120 


er fo 


divisions of equal areas of two agricultural plots, in which Plot I was a central plot treated 
the same as Plot II except for the amount of phosphorus applied as a fertilizer : 


6-2 5:7 6°5 6-0 6:3 
5-6 5-9 5-6 5-7 5:8 


58 57 60 60 58 
5-7 6-0 5-5 5-7 555 


Is there a significant difference between the yields on the two plots, using the difference 


Y=y 


between their means as a criterion of judgment ? 


x: 62 57 65 60 63 
02-03 05 0 
XxX: 0:04 0:09 0-25 
y: 56 59 56 57 58 


-y: -01 02 -0-1 0 O1 
Y: 0-01 0-04 0-01 0 0-01 
_ Xx a ST 
as = 6 and ame ies 
4 1 0-44 
2 — —_____(0.64+0-24) = —— 
. 1031022' ) 9 
t= ee: eer er 

0-44 f1 1 

9 Y¥10 10 

No. of df = 10 +10-2=18 


0 0:09 0:04 0-09 0 - 
57 60 S55 57 55 57 


Total 

58 57 60 60 58 60 
03-02-03 0 0-02 

0 0:04 0-64 


0 03 -—0:2 
0 0-09 0-04 


0 -0-2 
0 0:04 0:24 
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Now foos for 18 df. = 2-10 
: ; teal > to-.05 
The difference between the yields of the two plots is significant. 
Ex. 17-9. Two independent samples of 8 and 7 items respectively had the following 
values : 
Sample I: 9 11 13 11 15 9 12 14 
Sample IT: 10 12 10 14 9 8 10 
Is the difference between the means of the samples significant ? 
Given that if P{\t| > to} = 0-05, to = 2:16 for 13 df. and t) =2-13 for 15 df, 


Sol. Calculation of mean and s.d. 
Total 
x: 9 11 13 11 15 9 12 14 
X=x-11: © -2 2 0 res 1 3 6 
Xx’: 4 0 4 0 16 4 1 9 38 
y: 10 12 10 14 9 8 10 
Y=y-10: 0 2 0 yn; (are 0 3 
YY: 0 4 0 16 1 4 0 25 
ae ends d ieee eee 
ea WY Wie | 
_ (8). 
ae le AG 
? 2 23_(3)' 166 
sas eS ha) 40 
8 = gts f(2)+(S)}- 
8+7-2 | \16 49 (14) (13) 
47 73 


37 
}) = ee = TS 1.2 
[801 fi , 2  ¥(801)(5) | 
14:13 V8 7 


No. of df = 8+7-2=13 
to.o5 (13) = 2:16 
teal -< to.os 
Difference is not significant. 


Ex. 17-10. Two horses A and B were tested according to the time (in seconds) to run a 
particular track with the following result : 


Horse A: 28 30 32 33 33 29 34 
Horse B: 29 30 30 24 27 29 


Test whether you can discriminate between the two horses. You can use the fact that 5% 
value of t for 11 df. is 2-20. 


Sol. Let x and y be the time variable for horses A and B respectively. 


te Oa a 


< 
! 
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ee 


Cale 
x: 28 
X=x-32: -4 
x’; 16 
y: 29 
Y=y-29: 0 
Y’: 0 

xX = 32 

sy = H2 

sy = U2 

Sat 

74 

lt] = F 

No. of df. = 7+ 

toos = 2-2 

teal > to.0: 


Difference is significant an 
Ex. 17-11. The following table sh 


obtainable by four slightly different m 
respectively. 


Methods A 
4PM. 29-75 
8 PM. 39-20 
Are there significantly more bacte 
Sol. Calcul 
Method 4PM. & 
(x) 
A 29-75 
B 27-50 d 
C 30-25 
D 27-80 4 
Total 
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slots is significant. 
ms respectively had the following 


15 9 12 14 
9 8 10 
significant ? 
and to = 2:13 for 15 df. 
id s.d. 
Total 
15 9 12 14 
4 ~2 1: 3 6 
16 4 1 9 «38 
9 8 10 
1 -2 0 3 
1 4 0 25 
_ me 
y=10+7= 7 


166)| 801 
(*)h ~ (14)(13) 


37 
ee Sy 9 
¥(801)(15) ; 


‘ing to the time (in seconds) to runa 


33 29 34 
27 29 
horses. You can use the fact that 5% 


and B respectively. 
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—_ _—_-_—»—»-—~—— a OE 


Calculation of mean and s.d. 


Total 
x 28 30 32 33 33 29 34 
X=x-32 -4 2 0 1 lL «=3 Pas 
x 16 4 0 1 1 9 4 35 
y 29 30 30 24 27 29 
Y=y-29 0 1 t. 25> a9 0 —5 
yY 0 1 1 25 4 0 31 
7 5 219 _ 5 169 
= -—- =e =2?9 —-— = — 
: af 7? 6 6 
ee --(S) -s- 8-20 
se eae aad OE 49 49 
ia te -2 (4) 18 
go = oh [He Me 27 
1+6=2 | 7 6 462 
219 _ 169 
7 6 131V11 


f= SS = = 4 
Ml [2447 fa : 1 y(13)(2447) 
462 V7 6 


No. ofdf = 7+6-2=11 
to.os = 2-20 
Teal > 10.05 
Difference is significant and hence two horses can be discriminated. 
Ex. 17-11. The following table shows the mean number of bacterial colonies per plate 


obtainable by four slightly different methods from soil samples taken at 4 P.M. and 8 PM. 


respectively. ; 
Methods A B C D 
4PM. 29-75 27-50 30-25 27:80 
8 PM. 39-20 40-60 36-20 42:40 
Are there significantly more bacteria at 8 PM. than at 4 PM.? 
Sol. Calculation of mean and s.d. of the difference 
Method 4PM. 8 P.M. 
(x) () d=y-x d-d (d-d)* 

A 29-75 39-20 9-45 — 1-325 1-756 

B 27-50 40-60 13-10 — 2-325 5-406 

C 30-25 36-20 5:95 ~ 4-825 23-281 

D 27-80 42-40 1460 3-825 14-631 


Total 43-10 45-074 
ee 
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= 43-10 45-074 
d = —=10-775, S= 
4 3 
_ (10-775) 3a) = 21:55V3 
1 45-074 V45-074 
= 5-56 
No. ofdf. = 4-1=3 
Now to.os (3) = 3-18 < toa 


.. Difference is highly significant and hence there are significantly more bacteria at 
8 P.M. than at 4 PM. 

Ex. 17-12. From the data given below test whether there is a significant difference 
between the effects of two drugs, on the assumption that different random samples of patients 
were used to test the two drugs A and B. 

Additional hours of sleep gained by use of soporific drugs. 


Patient : 1 2 3 4 5 6 7 8 9 10 
Drug A: 0-7 -16 ~02 -1:2 -0O1 34 3-7 08 0 2-0 
Drug B: 1-9 0:8 11 0-1 -0-1 4-4 5-5 16 46 3-6 
Sol. Calculation of mean and s.d. of the difference 
Patient x y d=y-x (d- d) (d- dy 
1 0:7 1:9 1:2 — 04 0:16 
2 — 1-6 0-8 2-4 0-8 0-64 
3 ~ 0-2 1-1 1:3 - 0:3 0:09 
4 ~ 1:2 0-1 “1:3 -03 0:09 
5 - 0-1 -0-1 0 ~ 1-6 2:56 
6 3-4 44 1-0 - 0-6 0-36 
7 3-7 5:5 1-8 0-2 0-04 
8 0:8 1:6 0:8 — 0:8 0-64 
9 0 4-6 4-6 3-0 9:00 
10 2-0 3-6 1-6 0  —__—000 
Total 16-0 13-58 
= 16 13-58 
d = —=}- 2 = 
10 Bo 8 9 
1-6 4-8V10 
f= - 10 = = 4/12 
13-58 ¥13-58 
9 


No. of dff = 10-1=9 
fo.05(9) = 2:26 < tea 
Difference is highly significant. 


Ex. 17-13. Jn a certain experiment to compare two types of pig-foods A and B, the 
following result of increase in weights were observed in pigs : 
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Pig No. : 
Increase in FoodA : 
wt. in lbs. FoodB : 


(a) Assuming that two sam; 
better than food A ? (b) Examin 
the foods. 

Sol. (a) In this case the tw: 
of means test for unpaired data | 

Let x and y be the increase : 


FoodA 


WN OWN & WD 


; 


pes 
* 
I 
tad 
— 
N 
i 


8 = 


No. of df. = 
foos = 
At 5% level, difference 

is better. 
() In this case the two samp! 
means test for paired data will be 
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45-074 
3 


|-55¥3 


45-074 


2 are significantly more bacteria at 


+ there is a significant difference 
fferent random samples of patients 


drugs. 
6 7 8 9 10 
34 3:7 08 0 2-0 
44 55 16 46 36 


id s.d. of the difference 


Sipe Veen ile ee 
-x  (d-d)  (d-dy’ 
~0-4 0-16 
0-8 0-64 
203 0-09 
~03 0-09 
a6 2:56 
~0-6 0-36 
0-2 0-04 
~ 0:8 0-64 
3-0 9-00 
0 0-00 
13-58 
58 
_ 
8V10 _ 41 
13-58 


wo types of pig-foods A and B, the 
n pigs : 
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Pig No. : 1 2 3 4 5 6 7 8 
Increase in FoodA : 49 53 51 52 47 50 52 53 
w.t. in lbs. FoodB : 52. 55 52 53 50 54 54 53 


(a) Assuming that two samples of pigs are independent can we conclude that food B is 
better than food A ? (b) Examine the case when the same set of eight pigs were used in both 
the foods. 

Sol. (a) In this case the two samples of pigs are independent and hence the difference 
of means test for unpaired data can be applied. 

Let x and y be the increase in weights due to foods A and B respectively. 


eaeRrR RR OOO 


1 
= EX?-—(2x) a eas 
ny 8 8 
So7ee ao 
8 8 
247 135 
gp = 8 8 382 
8+8-2 112 
407 423 


No. ofd ff = 8+8-2=14 
toos = 2:14 < teal 
At 5% level, difference between sample means is significant. Since y >x, food B 


is better. 


(b) In this case the two samples cannot be regarded as independent. So the difference of 
means test for paired data will be applied. 
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12 
S = q 
2V8 
= —== ~4:32 
it 7B 3 
q 


No. ofdfi = 8-1=7 
lo.05 = 2°36 and 10.0) = 3-50 
Itcall > fo.05 and f.0) 
Difference between means is significant and hence food B is better. 
EXERCISES 


- To compare the price of a certain commodity in two towns, ten shops were selected at 
random in each town. The following figures give the prices found : 


Town A: 61 60 56 63 56 63 59 56 44 61 
Town B: 55 54 47 59 551s 575A GSB 
Test whether the average price can be said to be the same in the two towns. 

[Ans. Yes] 


. Eight pots growing three wheat plants each were exposed to a high-tension discharge 
while nine similar pots were enclosed in an earthenware cage. The number of tillers 
in each pot were as follows : 

Caged : 17. 26 18 25 27 28 «26 ©«©23~0©«©17 
Electrified : 16 16 22 16 21 #18 #«15- «20 

Disciss whether electrification exercises any real effect on tillering. Given that 
toos (15) = 2-131. [Ans |f] = 2:75] 
- Ina test given to two groups of students the marks obtained were as follows : 

First group : 18 20 36 50 49 36 34 41 

Second group: 29, 28 26 35 30 44 46 
Calculate student’s ¢ and state the relevant null hypothesis. [Ans. |¢| = 0-28] 
- The heights of six randomly chosen sailors are, in inches : 63, 65, 68, 69, 71 and 72. 
Those of ten randomly chosen soldiers are 61, 62, 65, 66, 69, 69, 70, 71, 72 and 73. 
Discuss the light that these data throw on the suggestion that soldiers are on the average 
taller than sailors; given that 


_ P = 0-539 for t=0°1 
* EN) P = 0-527 for t=0-08 
- The means of two random samples of sizes 9 and 7 respectively are 196-42 and 198-82 


respectively. The sum of the squares of the deviations from the means are 26-94 and 
18-73 respectively. Can the samples be considered to have been drawn from the same 


[Ans. |¢| = 0-099] 
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caesar eat seta 


10 


11. 


12, 


normal population, it bein; 
Two independent samples 
Sample 1 : 9 1 
Sample 2 : 10 OL 
Is the difference between t 


y= 


. Two types of batteries 4 a 


results are obtained : 
No. in sample 
A 10 
B 10 
Is there a significant differ: 


. Intelligence test of two gro 


if the difference of the mez 
Group of 12 girls : n 
Group of 8 boys : n 


. A farmer. grows crops on tw 


acre and on B Rs. 20 worth 
on the two fields are : 

Year 
Yield A Rs. per acre 
Yield B Rs. per acre 
Other things beings equal, | 
to continue the more expen 
Calculate the value of ‘¢’ in 
frequencies are : 
A: 16 10 
B: 8 4 
The yields of two types ‘T 
replications are given belo. 
the mean yields ? You may a 
1-476. 


Replication 

Yields in lbs; 2 
(Type 17) 

Yield in Ibs : 2 
(Type 51) 


The following figures show 
each was administered two 
two) 

Patient ; 1 2 
SoporificA: 1:2 18 - 
SoporificB: 1:5 16 
Apply an appropriate test to 
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a 


Total 
50 $2 53 
54 54 53 
—4 -2 = -—16 
16 4 O= 44 


ice food B is better. . 


owns, ten shops were selected at 
: prices found : 
3 59 S56 44 61 
S10 65570 54058 
same in the two towns. 

[Ans. Yes] 
osed to a high-tension discharge 
vare cage. The number of tillers 


28 26 4 230=«17 
18 15. 20 

effect on tillering. Given that 

[Ans. |¢| = 2-75] 

ibtained were as follows : 
49 36 34 41 
30 44 «446 
thesis. [Ans. || = 0-28] 
ches : 63, 65, 68, 69, 71 and 72. 
5, 66, 69, 69, 70, 71, 72 and 73. 
on that soldiers are on the average 


0-1 
3:08 
spectively are 196-42 and 198-82 


[Ans. |¢| = 0-099] 


ns from the means are 26-94 and- 


» have been drawn from the same 
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normal population, it being given that fo.95 (14) = 2-145 ? [Ans, 2-6, No] 
Two independent samples of 8 and 7 items respectively had the following values : 


Sample 1 : 9 11 130011 15 9 12 14 
Sample 2 : 10 12 10 14 #9 8 10 
Is the difference between the means of the samples significant ? Given that 
_ P = 0-874 for t=1-2 ‘an Graaao 
"|p = 0-892 for t=1-3 lied) 


. Two types of batteries A and B are tested for their length of life and the following 


results are obtained : 


No. in sample Mean Variance 
A 10 500 hours 100 
B 10 566 hours 121 


Is there a significant difference in two means ? [Ans. Yes] 


. Intelligence test of two groups of boys and girls gave the following results. Examine 


if the difference of the means is significant. 
Group of 12 girls : mean = 84, 
Group of 8 boys : mean = 81, 


s.d. = 10 


s.d. =12 [Ans. No] 


. A farmer grows crops on two fields A and B. On A he puts Rs. 10 worth of manure per 


10. 


11. 


12. 


acre and on B Rs. 20 worth. The net returns per acre, exclusive of the cost of manure, 
on the two fields are : 


Year 1 2 3 4 5 
Yield A Rs. per acre 347 28 42 37 44 
Yield B Rs. per acre 36 33 48 38 50 


Other things beings equal, discuss the question whether it is likely to pay the farmer 
to continue the more expensive dressing ? Given that fo.95 (4) = 2-78 [Ans. t= 3-8] 
Calculate the value of ‘t’ in case of two characteristics A and B whose corresponding 
frequencies are : 

A: 16 10 8 9 9 «8 

B: 8 4 § 9 12 4 [Ans. || = 1-66] 
The yields of two types ‘Type 17’ and ‘Type 51’ of grains in pounds per acre in a 
replications are given below. What comment would you make on the differences in 


the mean yields ? You may assume that if there be 5 d.f. and P= 0-2, tis approximately 
1-476. 


Replication : 1 2 3 4 5 6 
Yields in lbs: 20°50 24-60 23-06 29-98 30-37 23-83 
(Type 17) 

Yield in lbs ; 24:86 26-39 28-19 30-75 29:97 22-04 
(Type 51) [Ans. ¢= 1-49] 


The following figures show the additional hours of sleep gained by 10 patients when 
each was administered two soporifics A and B (with an adequate period between the 
two) 
Patient : 1 2 3 4 5 6 7 8 9 10 
SoporificA: 1:2 1:8 -0:3 -0-7 01 3-1 2:2--15 00 2-1 
SoporificB: 15 16 04 00-06 25 45 19 2:2 3.0 
Apply an appropriate test to see whether the soporifics really differ in average effect. 
[Ans. ¢= 2:1] 
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13. In each of 10 pairs of rats, one receives protein from raw peanut while the other 


14. 


15. 


16 


receives it from roasted peanuts. 


Pair : 1 2 3 4 5 6 7 8 9 10 
Raw >: 61 60 56 63 56 63 59 56 44 61 
Roasted : 55 54 47 69 S51 61 #57 54 62 = 58 


Test whether or not roasting the peanuts had any effect on their protein value. 
[Ans. Roasting had no effect on protein value] 


In a rat-feeding experiment, the following results were obtained : 

Diets : Gain in wts in gms 

High Protein: 13 14 10 11 #12 #16 «#10 #9 #11 =#«12:°9 12 
Low Protein: 7 WW 10 8.10 13 9 


’ Investigate if there is any evidence of superiority of one diet over the other. 


17. 


18, 


19. 


[Ans. |¢| = 1-93] 
Mitchel concluded a paired feeding experiment with pigs on the relative value of 
limestone and bonemeal for bone development. The results are given below : 


Pair : 1 2 3 4 5 6 7 8 
Lime stone : 49:2 53:3 506 520 46.8 50-5 52:1 53-0 
Bonemeal : 51:5 549 52:2 53:3 5160 541 54-2 54:3 


Determine the significance of the difference between the means in two ways (1) by 
assuming that the values are paired (2) by assuming that the values are not paired. 
[Ans. Difference is significant] 

Ten soldiers visit a rifle range for two consecutive weeks. For the first week their 
scores are : 

67, 24, 57, 55, 63, 54, 56, 68, 33, 43 
and during the second week they score, in the same order : 

70, 38, 58, 58, 56, 67, 68, 72, 42, 38 
Examine if there is any significant difference in their performance. [Ans. |¢| = 2-04] 


Eleven school boys were given a test in mathematics. They were given a month’s 
further tuition and a second test was held at the end of it. Do the marks give evidence 
that the students benefited by extra coaching ? 
Roll No. > 1 2 3 4 5 6 7 8 9 10 11 
Marks First test : 23 20 19 21 18 20 18 17 23 16 19 
Secondtest : 24 19 22 18 20 22 20 20 23 20 17 
[Ans. || = 1-48] 
To test the desirability of a certain modification in typists desks, 9 typists were given 
two tests of as nearly as possible the same nature, one on the desk in use and the other 


on the new type. The following differences in the number of words typed per minute 
were recorded : 


Typist : A B C D E F G H I 
Increased no. of 
words per min: 2 3 0 3 -1 4 -3 2 5 


Do the data indicate that the modification in desk promoted speed in typing ? 

[Ans. t= 1-96] 
Drugs A and B administered to 10 different persons each indicated the following rises 
in B.P. 
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A ; 3 ( 
B : -1 -3 
Stating the assumptions n 
(i) Drug A has no effect « 
(zi) Drugs A and B are eq 
20. Use t-test to examine the : 


Sample siz 
Type l 8 
Type ll 7 


17.3-3. Test for the significance 


Consider random sample (x, 
Let the correlation co-efficient 
correlation co-efficient. The hy 
‘Whether population correl: 
Assuming this hypothesis, t 


follows t-distribution with (n - 2 


To test the significance, calc 
the significance is tested as usua 


Ex. 17-14. Test whether the 


Sol. t= 


No. of df = 
fo.os = 
Correlation is signific 


Ex. 17-15. 4 random sampl 
correlation co-efficient of 0-3. Is 


Sol. t= 


No. of df = 

fo.os = 

: Correlation is not sign 

Ex. 17-16. Find the least u 
bivariate normal population sign 


Sol. No. of df, = 


a boos = 
The least value of ‘7’ signifi 


rJ18—2 : 
| yl-r? 


*The proof of this is beyond the scope o 
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romoted speed in typing ? 

[Ans. t= 1-96] 
each indicated the following rises 
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A : 3 Oo -1 2 1 2 -1 0 1 3 
B > -1 ae) Oo 1 1 0 —2 0 3 1 


Stating the assumptions necessary, test the following null hypothesis : 
(4) Drug A has no effect on B.P. 
(ii) Drugs A and B are equally efficacious in respect of B.P. 
20. Use t-test to examine the superiority of type I electric bulbs in the given data : 


Sample size Sample mean Sample variance 
Type I 8 1234 1296 
Type Il 7 1036 1600 [Ans. t= 9-3] 


17.3-3. Test for the significance of an Observed Correlation Co-efficient 


Consider random sample (x1, 91), (%2, 2), -..-- (4m Yn) froma bivariate normal population. 
Let the correlation co-efficient calculated from the sample be r and p the population 
correlation co-efficient. The hypothesis to be tested is : 

‘Whether population correlation co-efficient is zero i.e., p = 0’. 

Assuming this hypothesis, the statistic 


rjn-2 


yl -r? 
follows t-distribution with (n - 2) af-* 


To test the significance, calculated value of ‘?’ is compared with the tabulated value and 
the significance is tested as usual. 


Ex. 17-14. Test whether the correlation is significant if r = 0-6, n = 18. 


_ (0-6)VI16 | 
= ore = 
No. of df. 18-2=16 
toos = 2:12 < teat 
Correlation is significant. 


Ex. 17-15. A random sample of 18 pairs from a bivariate normal population showed a 
correlation co-efficient of 0-3. Is this value significant of correlation in the population ? 


(0-3)/18-2 
Sol. t= 1- 0.09 = 1-26 
No. of df: 18-2=16 
toos = 2:12 > tea 
Correlation is not significant. 


Ex. 17-16. Find the least value of ‘r’, in a sample of 18 pairs of observations jroma 
bivariate normal population significant at 5% level. 


Sol. No. ofdf, = 18-2=16 
2s toos = 2°12 
The least value of ‘r’ significant at 5% level is given by 


Sol. 


18-2 
Rey > 2-12 
-r 


*The proof of this is beyond the scope of this book. 
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ie, 16r > (2:12 (1-r) = 4-4944(1 — 7) 
ie., 20-4944 > 4.4944 
ie, r > 0-2193 
ie., Ir} > 0-4683 


Regd. value of |r| = 0-4683. 


Ex. 17-17. A random sample of 15 from a normal population gives a correlation co- 
efficient of ~ 0-5. Is this significant of the existence of correlation in the population ? 


(-0-5)V13 
Sol. t=  J0-75 = — 2-08 
foos for 13 diff = 2°16 > [tcall 
Sample correlation co-efficient is not significant. 
17.3-4, Test for the significance of an observed Regression Co-efficient 


Consider a random sample (x), 1) ..... (%w Yn) from a bivariate normal population. Let 
the equation of line of regression of y on x (obtained from the sample) be 


y-Y = B(x-x) 
where b = regression co-eff. of y on x. 
Let Y, = y+b(x;-%). 


The hypothesis to be tested is : “The regression co-efficient of y on x in the population 
is B”. Assuming this hypothesis, the statistic 


7 1/2 
(n-2) )) (x; -%)? 
t= (b- B) et 
X oi-%)? 
i=] 
follows ¢-distribution with (n — 2) d.f. 


17.3-5. Test for the Significance of a Rank Correlation Co-efficient 

Let p be the rank correlation co-efficient obtained from a sample of size n. The hypothesis 
to be tested is : 

“population rank correlation coefficient is zero.” 

Assuming this hypothesis, the statistic 


1/2 
fonts n-2 
‘lip? 


follows t-distribution with (n — 2) d.f. 
Ex. 17-18. /2 pictures submitted in a competition were ranked by two judges with 
results as shown in the table below: 


Pictures : A B C D E F G H IT J K L 
Rank assigned by 

Ist judge : 5 9 6 7 1 3 4 12 2 11 10 8 
Rank assigned by 

2nd judge : 5 8 9 11 3 1 2 10 4 12 7 «6 


Calculate p the rank correlation co-efficient. Is there a lack of independence in 
these ranking ? (Assume that on the hypothesis of independence of two sets of n readings 
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1/2 
n-2 
t=p fq follows t-dis 


Sol. Let d be the differenc: 
Now Xd’? =0+1+9+ 16 


Pp 


No. of df : 
fo.05 
Ranking is significan 


p—_ 
° 


Is a correlation co-efficien 
pairs of values from a nor 
Fine the least value of r, - 
significant at 5% level. (G 
Determine the range withii 
in random sampling from 
(Given that P{|t (8)| > 2-31 
4. Determine the least value 
random sampling from a t 
(i) the sample size is 5, 
(ii) the sample size is mo1 


Lg 


o 


17.4. F-distribution 


Let x1, X2, ... Xn, and yy, y2, 
be two independent random sany 
2 
Oo. 


Let x and y be the sample 1 


a) 
nN 
Ti 


The statistic F is defined by 
F= 


Let Vy) = 


Vy F 
v2 
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4-4944(1 —7°) 


ulation gives a correlation co- 
lation in the population ? 


ion Co-efficient 
yivariate normal population. Let 
ithe sample) be 


icient of y on x in the population 


1/2 
x) 


we 
9) 


| Co-efficient 
asample of size n. The hypothesis 


were ranked by two judges with 


GH I J KL 


2 10 4 12 7 6 


‘here a lack of independence in 
ndence of two sets of n readings 
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ee 


1/2 
n—-2 
t=p 23 oF ) follows t-distribution with (n — 2) df. Given that for 10 df, to.os = 2-23. 


Sol. Let d be the difference in ranks assigned to the same individuals. 
Now Xa’ =04+1+4+94+164+44+44+4+4+4+1+9+4=60 


_ ,_ ((60) _ 
eet (12)(143) ~ om 
__ vi0 _ (0-79) Vi0 
= (0-79 
a asm n i= -(0- -79)? ¥0-3759 
= 4-075 
No. ofdf. = 12-2=10 


toos = 2:23 < tea 
Ranking is significant and hence there is lack of independence. 


EXERCISES 


1. Isa correlation co-efficient of 0:5 significant, if obtained from a random sample of 12 
pairs of values from a normal population ? [Ans. No.] 
2. Fine the least value of r, in a sample of 25 pairs from normal population, which is 
significant at 5% level. (Given that for 23 d. J. to.os = 2-07). [Ans. 0-4] 
3. Determine the range within which r will not be significant at 5% level of significance 
in random sampling from a bivariate normal population when the sample size is 10. 
(Given that P{|t (8)| > 2-306} = 0-05) 
4. Determine the least value of |r| that will be significant at 5% level of significance in 
random sampling from a bivariate normal population when 
(i) the sample size is 5, 
(ii) the sample size is more than 30. 
(Given that P{|t(3)| > 3-182} = 0-05) 
17.4. F-distribution 
Let x1, X2, ... Xn, and yi, 2, w+ Yay 
re two independent random samples drawn from the same normal population with variance 
Oo. 
Let x and y be the sample means and 


S : casi 
The statistic F is defined by 
2 
Ee (s? >s?) 
2 


Let y= m-l, w=n-)1 


wF _ ( (m-)S? 
v2 (n> -1) S$ 


= (n, 81/0") / (nz 87/0") 
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Now 1; s,’/o7 is a y" variate with v, df, and np s,2/o? is a w’ variate with v, df. 


WF, yO 
— isa B, (2, “2 vaniate. = 
vo 2 2 


Distribution of F is 
Ad bees 
(2)? 
is 1 V2 vy F ~ 
apP = a(t “2 vy +2 & 
ay i nz}? Mean = | 
v2 
My Put r=; 
1 vp? ov? F2 ) 
“E3) one dF 0< F<. 2) = | 
22D POT ME). 
This distribution is called the distribution of the variance ratio F with y, and v2 df. a x 
Ex. Let y,’ and wy" be two independent chi-square variates with n, and nz df, Find 
brah. A wi lm _ 
the distribution of F = Pay aa lo = 
Y2/n2 
17.4-1. Constants of F-Distribution oe 
u’-(0) = E(F’) 
a ee} ba 
“ 2) 5 bs alr 
B 2°9 (v. +, F) 2 Mode. The density function of 
4% start 
= v? vy i Fé dD v 
(22) * enn? 4G 
27.2 (v2+y,F) 2 
dx 
Put WF= wx => dF= “2 
k yy ; YL 
2 
r 1 oo aise 2 = lal 
v2 x Vv} 
= dx af 
(2) e(Zt yi (2+r}+(2-r) 2 
49 (1+x) 2 
. ML 
= (| ! p+. %2—r) df _ ve 
Mp wy \2 2 dF vy 
22 Bi 3° 
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isa w’ variate with v2 df. 


dF 0<F<© 


vy + V2 
Fy 2 
nce ratio F with v, and v2 df. 
variates with n, and n, df. Find 
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vy (v7 -2)....(v -2r) 
Mean = pj (0)=—2— >1. 
Vy -2 
Put r= 2 
, v. vy, (vy, +2 
yw’, (0) = (2 1 (4) +2) 
wy) (v2 ~2)(v2 -4) 
v3 (vy, +2) 


M2 = pr (0)— {11 (0)}? 


~ v3 (v, +2) ' ( V> i 
w(v2~2)(v2-4) (v2 -2 


__2v3 (vy, + -2) 
v1 (v2 — 2)? (v2 -4) 


Mode. The density function of F variate is 


M2 ME 
Zz 1 v2 ve F2 
a ae e( Zt 2) yp +Vv2 
2) Ope Fy? 

‘ “1 _» vy +¥2 

vy 2 on ae 

MO YZ ra F (v2 +; F) 
ve uae uGas we 


(v2 +4 Fy" rt 


“I 2 MtV2 4 


2 
Es sian! (v2 +Vy F) 2 


dF Vy “2 
(7 2 


(v2 +¥, FY"! +¥2 


pees eer eS My 
vj tv 
dF p(7t 2) -F? Gee ene 
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df ~ 
dF 0=> 
F = 0, and 
v vp+¥ 
(H-t)orenm-F (2) vy, =0 
Vo (1, -2 
= ye ED 
vy (v2 +2) 
for F=0, f(F)=0 which is minimum value of f(F). 
for Fe= wii?) f(F) is maximum 
vy (v2 +2)’ 
-2 
ei 
vy (v2 +2) 


17.4.2. Chief features of F-Probability Cure 
The equation of the F-probability curve is 


V2 V2: id ie 
vy v? F2 
ye) ee 
p(2. 72 (vo +, F) 2 
(i) At F=0, y= 0 
and as F430, y 0 


..  F-axis is asymptote to the curve at positive extremity. 
(2) Mode is at the point 

v2 (¥ -2) 

Vy (v2 + 2) 

which exists only when v; >2 ° (.. F20) 

v2 (4-2) | 

Vy (12 + 2) 


F= 


Now Mode-1 = 


a. EI 2 
Vy (v2 + 2) 
=> Mode < 1. 
(3) Karl Pearson’s co-efficient of skewness is 
mean ~ mode 
s.d. 

Since mean > 1 and mode <1. 

F probability curve is highly positively skewed. 

(4) The pts of inflextion of F curve exist when v, > 4 and are equidistant from mode 
(See 17.4.3). 

(5) y increases steadily at first until it reaches is maximum value and then decreases * 
slowly. 
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17.4.3. Point of Inflextion 


The shape of the probat 


y 


The equation of the probabilit 


Put WF = 


Then y 


where c 


and 


log y 
Differentiating w.r.t. x 
1 dy 
y dx 


Differentiating again 


At points of inflextion 


ay 
dF? 
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(F). 


imum 


. 0< F<, 


ctremity. 


> 4 and are equidistant from mode 


naximum value and then decreases 
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: 
The shape of the probability curve is approximately as shown. 


y 


17.4.3. Point of Inflextion 
The equation of the probability curve is 


Ly 2 Me. 
vy? v92 F2 
™ a “2 “~¥2 
2° 2) (+H F) 2 
Put VWF = vW2Xx. 
1=1 
x 
Then = Cc. 
y (l+x)'*™ 
1 vy; v. 
where grate eat eM ye 


log y = loge +(/- 1) log x ~ (1+ m) log (1 + x) 
Differentiating w.r.t. x 
1 d 1 
kin ae (22059) ates 
ya x 1+x 


Differentiating again 


4(2) 4% ie ot) 


y? (de y dx? x? (1+x)? 
At points of inflextion 
2 
diy 2 (wi) a 
dF? \¥) dx? 
3 
d’y vy} dy 
and —z =I—|] —> #0 
dF? (2 a 
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Points of inflextion are given by 


(8) _ (-t) , ltm: 

y? dx 7 x? * Gaxy? 

eee ae ee 

ie., {o-v3- a ere ail (ex =0 

ie, {(1- 1) (1 +x) —(1+ mx}? -(J- 1) (1 +x)? + (I+ m) xe =0 

ie., (I-12 —(1—1)} (1 +x)? + {(L + mt (14+ m)} 2 - 21-1) (1+ m) x (1 +x) =0 
ie, 2 4-1 -(1-1) + (1+ m+ (1+ m)-2(1- 1) (1+ m} 


+ 2x (1-1) -(I-1)-(/- 1) (I+ m)} + {(- 1° - 7-1} =0 
The roots of this equation give two points of inflextion. Let these be x, and x, 


~2{(1-1)* -(I-1)-(/-1)(1+m)} 


+ = 
m2 Ay? —(1-1) + (4m)? +(14 m)-2(1-1)(1+m) 
7 2(1-1)(m +2) 
—(1-1)(m+2)+(+m)(m+2) 
_ 2¢-)) 
m+} 
2(v, -2 
ae 2(4,-2) 
v2 +2 
Let F, and F2 be the corresponding values of F. Then F; and F; are pts. of inflextion. 
We have VWF = wx, => F = mes 
1 
and F, = ae x2 
vi 


¥2 
By iy = = OG x) 
al 


_ 2 2%=2) 
Vy V2 +2 
= 2 (mode). 


F, — mode = mode — F). 
F, and F, are equidistant from mode 
3 


The condition <- #0 > vjp>4* 


Points of infflextion exist if v, > 4. 


Ex. 17-19. (a) [fx has a F-distribution with (m, n) df. show that ~ — has a F-distribution 
with (n, m) df. 
(b) Deduce that, for any k>0 


Pesky +Pqy sch 


*The proof of this is left as an exercise for the reader. 
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where x and y are F-distributed \ 
Sol. (a) Dist. of x is 


dP = - 

B 

Put x= us => dx ba 

y y 
Dist. of y is 

dP = - 

B 


1 
Bs is F distribute: 


=> y= 
(b) Since tota: prob. is unity, 
P{x: 
| Now P{x2k} = P 
| 
= P 
l am 
where y= 7 is F-distribute: 


Pirskh +P{ys7} 


Ex. 17-20. If v, = v2, the me 
O10; = 1 
where Q), Q3 are quartiles. 
Sol. Let vy; =v. =v. 
Let x be F distributed with (v 


1 
Then y = . also is F-distr 


P{xsa} = 
() Let k be the median. 
Then P{x<k} = P. 
= P 
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where x and y are F-distributed with (m, n) and (n, m) dfs. respectively. 
Sol. (a) Dist. of x is 


ne I< 
m? n2 x 
dP = PSE min &, OSX < 
(5-5) corm) 2 
- 1 1 
rt m)x 5 Put x= — => dx =-3= 
)} 2-2-1) (I+ m) x (1 +2) =0 y y 
_1)(1+m)} Dist. of y is : 
1-1 -(- 1} =0 woth (1)? 
ion. Let these be x, and ~: 2 2 
tion. Le 1 2 dP = m*n y = iy 
-d-+m) (2.2) my2 ” 
“4(1+m)-2(I-1)(l+m) ! ae [n+ | 
(m+2) m2 n2 Pi 


dy Os y<o 


1 
= y = — is F distributed with (n, m) df 


(b) Since totai prob. is unity, 
Pix sk} +P{x2k}=1 


rel 


1 
where y= = is F-distributed with (n, m) df. 


hen F, and F; are pts. of inflextion. 


ty 


iH 
% 
ren 
fe 
lA 
xf 
—— 


Now P{x 2k} 


Pixsky +P{y sche. 


Ex. 17-20. If v; = v2, the median of F distribution is at F = 1. Show also that 
0:03 = 1 
where Q, Q3 are quartiles. 
Sol. Let v, =v. =v. 
Let x be F distributed with (vy, v) df. 


1 
Then y = 2 also is F-distributed with (v, v) df 


1 2 tips Me 
1) df, show that — has a F-distribution 
x 


— Pix<a} = P{ysa}, for any a. 
(i) Let k be the median. 
Then Pixsk} = P{x2k} 


1 
| = Ply<— 
| » it 
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which is possible only when k = 1. 
.. Median =1. | 
(ii) Now P{xsQ,} = P{x2Q3} 


1 
re 
[53 

= P|x<— 
Q; 


which is possible only when 


Q) = ion 
Leé., O103 = 1 


EXERCISES 


2, show that 


#2. 
2 
v2 


2. Ifx has a F distribution with (1, n2) df, show that 


-1 
[1m =| 
n2 


1. If Vy 


tt 


P(F 2 Fo) 


has a Beta distribution. 


3. If vy = v2 =n-—1, show that 
Ne eee 
n- 


4. Given that 
P{F (10, 12) > 2-753} =0-05 


and P{F(1, 12) > 4-747} = 0-05 
find 

P 4 F(12,10) > 

(i) ir ) sat 


(ii) P {-V4-747 < 4(12) < 4-747}. 


17.4-4. Relation between t and F distributions 
Student’s ¢-distribution is 


dP = —=——~ 


where v = No. of df 


} 
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2 


Put tf =) 
dt = - 
dP = 4 


which => x isa F variate wi 
Remark. All tests of signi 

distribution. 

17.4-5. Relation between F an 
F-distribution with v;, v2 d. 


Hl 


dP 


Let A=" 


al 


’ 
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where v= No. of df. 
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Put ‘ 


t x => t= Vx 
d i 
hd ade 
2vx 
_! 
1 1 2 
dP = 2. —_ 
Vv fz ;| wal 
op (i+% , 
V 
ey | 
1 1 x io peed 
Ww oft 2) eee oS 
Pla 3 (1+ 2)? 
Vv 
Ls 1 
v2 x2 


which = x isa variate with df. 1 and v. 


Remark. All tests of significance basesd on ¢-distribution can be done by using F- 
distribution. 


17.4-5. Relation between F and y,? 
F-distribution with vj), v2 df. is 


v4 Vp) _ ~ 
eves gp te 
dP = “fu. te) vo Vy As dF 
B 2 ) (v2 + 4 F 2 
r vy, + V2 Mb Mbs 
2 1 1 ve F% 
r vo ML r 4] vj t+v2 dP 
2) 2ST) fine) 2 
ye 
(2%) 
2 1 
Let MoS a a 
Ge 
2 Vy 
vyytv2 1 
-{2 2-1} ViEWD: a 2 a) 
Viel? JV 9 ae 
-f2-1} , ek ai 
Ine |? [e-1 Be ie 


a 
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yy ytv2-1 
e 2 (vy, + v2 -2) 2 1 
pal 21 M1. 
22 (v2-2)2 2 vy 
"I "I y+v2~-1 
ee ac Oe es a 
e 2.22 {ret | 
7 mod 
2)2 2 
j|—-—— 
V2 
vy 7-2 
v,-1 V2 2 


yy 
UR. Mi Soe, 2h 
>e 2.22 5.92 
e 
as ¥2 — co 
hae 
yt? 22 nai 
VF | uy vy, F)\2 
1) v2 vy 
VWF 


mar d ee as ¥7 > oo 
As v) > ©, probability differential is of the form 


Tie 3B. Mo oe 
dP = 22? F2 e 2 dF 


ia 


—= 


NF vy 


+.) 
So ge: Ee” <divi ek) 
2712) 
2 
> v, F isay> — variate with vy, af. 


17.5. F-tests 


Tests of significance based of F-distribution are called F tests. Various F-tests are : 
(i) For equality of population variance. 
(ii) For the significance of an observed multiple correlation co-efficient. 
(vii) For the significance of an observed sample correlation ratio. 
(iv) For testing the linearity of regression. 
All these tests are for small samples. 
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Rules of Decision 
Let P=P{F> Fy (v1, V2)} 
For.a given value of P and f 
of F-tables. 
The value Fo is called the cr 
To test the significance the 
certain specified level of signific 
If Fea > Frap, the null hypott 
and if Feat < Fiap, the hypothesis 
Ex. 17-21. When v, = 2, s 
significant probability p is 


le 


F= 


where v; and v2 have their usual 
Sol. The dist. of F is 


dP = ~ 
p 
Let Fo be the significance le: 


Then p=P 
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2, 
yy-2Z 
1 V2 2 


F’ tests. Various F-tests are : 


2lation co-efficient. 
ation ratio. 
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Rules of Decision 

Let P= P{F > Fo (4, v2)} 

For a given value of P and for v, v7 dfs, values of Fy have been tabulated in the form 
of F-tables. 

The value F is called the critical value of F for V1, ¥2 dfs. at level of significance P. 

To test the significance the calculated value of F is compared with tabulated value at 
certain specified level of significance. Generally 5% or 1% levels are taken. 

If Fea > Frab, the null hypothesis is rejected and the difference is said to be significant 
and if Fa) < Fiap, the hypothesis is accepted at the level of significance adopted. 

Ex. 17-21. When v, = 2, show that the significance level of F corresponding to a 
significant probability p is 


x2 
v2 v 
ee ae 
is 2 


where v, and vy have their usual meanings. 
Sol. The dist. of F is 


2 
2v52 dF 
VY 241 
p(t 2 (v2 +2F) 2 


Let Fo be the significance level of F corresponding to the probability p. 


dP = 


foe} 
Then p = P{F2Fy\= | aP 
fy 
% 
= 2v,? i aF 
= . S 
a(t “2. / (v +2F) 2 
vy roo) 
_ 2h Joa 1 
~ v2 v2 2 
a(t 2 | (v2 +2F) 2 Fo 
v2 
. 2v7 1 
v2 v2: 
Be) On +2%)2 
” r( 241] 
= 2v7 Se ge, Vs 
v9 v2. 
ror?) Gy +a%)? 
3) 2 1(2} 
ee ae 1 Taye] 
= 2, : > (2 T()=1) 
fe “2 (v. +2F) 2 
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v2 
— 2 
ve — 
= => Fo = 2 P =n 
(v2 +2Fp) ? 


17.5.1. Test of significance for equality of population variance 
Consider two independent random samples x), X2, ..... Xn, and 1, V2, -..-- Yn from normal 
populations. The hypothesis to be tested is: ‘The population variances are same’. 
Assuming this hypothesis, the statistic 


SP (2 2 
Bey (s; > 33) 
2 
ny 
where : Se by (x; -X) 
ny -1 i=l 
1 2 = 
and S? = (yj -¥), 
ny~1 jz 


follows F-distribution with v, =n; — 1 and v. = n2— 1 degrees of freedom. 

By comparing the calculated value of F with the tabulated value for v, and v, df. at 
certain level of significance (5% or 1%) the significance is tested. 

Ex. 17-22. /fis known that the mean diameters of rivets produced by two firms A and 
B are practically the same but the standard deviations may differ. For 22 rivets produced by 
A the s.d. is 2:9 m, while for 16 rivets manufactured by B the s.d. is 3-8. Test whether the 
products of A have the same variability as those of B. 

Sol. n, =22, no = 16, s;=2:9 and s,=3-8 


mst _ (22) (2-97 


2 => . 
eae oo oe 
ee 
ar 52 = LOC ~ 15.49 
15 
ae Sf 15-4 1-748 
~oSP 881 


Nos. of degrees of freedom are 16 — 1 = 15 and 22-1 =21 
: Fo.os = 2:18 > Feat 
Variability for two types of products may be same. 

Ex. 17-2. «::ve2 below are the qualities of ten items (in proper units) produced by two 
processes A and B. Test whether the variability of quality may be taken to be the same for 
the two processes. 

Process A: 3 7 5 6 5 4 4 5 3 3 

Process B: 8 = 5 7 8 3 2 7 6 > 7, 

(F-value for n, =9 and n2 = 9 degrees of freedom is 3-18 at 5% level of significance and 
5-35 at 1% level of significance). 

Sol. 

Process A :x > 3 7 5. 6 5 4 Total 
X=(x-x) > - -155 2:5 0-5 1-5 0-5 — 0-5 
X-> 2:25 625 025° 2:25 025 0-25 
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Process B: y—-> 8 
Y=(y-j) 3 22 
Yu5 4-84 


xX = 


pe 


Degrees of freedom'are 10 
; Fo.05 
Variability of quality 
Ex. 17-24. Two random sar 
are characterized as follows : 
Population from 
which the sample 
is drawn 
I 
I 
You are to decide if the twe 
Sol. Let x and y be the obs 


Now X(x-x) 


and Z(y-) 


Nos. of df are 8-1=7 a1 
i Fo.0s 
Variances of two pot 


1. Two independent sample: 
the variable (weight in ot 
Sample I : 9 
Sample I: 10 
Test whether the estimate: 
Fo.o5 for 7 and 6 df is 4+: 
[Ans. Not significant] 


MATHEMATICAL STATISTICS 


2 


zliige 8 oat |) 


iriance 
1 aNd V1, V2» +++ ng from normal 
n variances are same’. 


‘ees of freedom. 

lated value for v; and v2 df. at 
tested. 

2ts produced by two firms A and 
differ. For 22 rivets produced by 
the s.d. is 3-8. Test whether the 


‘81 


ne. 
in proper units) produced by two 
may be taken to be the same for 


4 4 5 3 3 
2 7 6 5 7 
8 at 5% level of significance and 


6 5 4 Total 
1-5 0:5 — 0:5 
25 0:25 0:25 
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OO OE 
4 5 3 3 = 45 


— 0-5 0-5 —155 —155 
0-25 0-25 2-25 225 =16°5 


Process B: y—> 8 5 7 8 3 2 
Y=(y-y) > 2:2 -0:8 1:2 2:2 —2:8 — 3-8 
ys 4-84 0-64 1-44 4-84 7:84 14.44 

7 6 5 7 =58 


1-2 0-2 —0:8 1:2 
1-44 0-04 0-64 144 =37-6 


x = 45, 735-8 Gea O> and s2 a2) © 
9 9 
po ele S538 
16-5 — 


Degrees of freedom are 10- 1=9 and 10—1=9. 
ae Foos = 3:18 > Feat 
Variability of quality may be taken to be the same for two processes. 
Ex. 17-24. Two random samples of sizes 8 and 11, drawn from two normal populations, 


are characterized as follows : 


and 


1. 


Population from Size of Sum of ' Sum of 
which the sample the sample observations squares of 
is drawn observations 
I 8 9-6 61:52 
IT 11 16-5 73-26 


You are to decide if the two populations can be taken to have the same variance. 
Sol. Let x and y be the observations for two samples. 


S 1 
Now X(x - x)? = Xx? ~N,x* = (x)? arr (Ex)? 
1 
= 61-52— , (9-6)? = 50 


L(y-jpy? = 73-26 ~ = (16'5)* = 48:51 


»_ 50 »_ 48-51 
Bh a an ag 
SP 500 
Oe ss aes 


Nos. of df are 8 -1=7 and 11-—1=10 
Fo. 05 = 3:14 > Fal 
Variances of two populations may be same. 


EXERCISES 


Two independent samples of 8 and 7 items respectively had the following values of 
the variable (weight in ounces) 

Sample I: 9 11 13 1] 15 9 12 14 
Sample II: 10 12 10 14 9 8 10 


Test whether the estimates of the popalation variances differ significantly. (Given that 
Foos for 7 and 6 df is 4-21). 
[Ans. Not significant] 


6727 


10. 


11. 


MATHEMATICAL STATISTICS 


. In two groups of ten children each, increase in weight due to two different diets in the 


same period were in pounds. 

8, 5, 7, 8, 3, 2, 7, rae : 

3, 7, 5, 6, 5, 4, 4, 5, 3, 
Test whether the variances differ significantly. Ge iven that Fo.os for 9 and 9 df is 
3-18). [Ans. Not significant] 


. Two random samples drawn from two normal populations are : 


Sample I: 20, 16, 26, 27, 23, 22, 18, 24, 25 and 19 
Sample II: 27, 33, 42, 35, 32, 34, 38, 28, 41, 43, 30and37 
Test whether the two populations have the same variance. [Ans. 2-14 Not significant] 


. The students of the same age of two different colleges were tested for variability of 


intelligence. The /.Q’s of 10 students from one college showed a variance of 20 and 
those of an equal number from the other college had a variance of 15. Discuss whether 
there is any significant difference in vaiability. [Ans. Not significant] 


. In one sample of 8 observations the sum of the squares of the deviations of the sample 


values from the sample mean was 84-4 and in other sample of 10 observations it was 
102-6. Test whether this difference is significant at 5% level, given that Fo.95 for 7 and 
9 degrees of freedom is 3-29. [Ans. Not significant] 


. Two samples of sizes 9 and 8 give the sum of squares of deviations from their respective 


means equal to 160 inches squares and 91 inches squares respectively. Can they be 
regarded as drawn from the same normal population? [Ans. 1-54, Not significant] 


- Two random samples gave the following results : 
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“The multiple correlation c: 
Assuming this hypothesis, t 


Fe=.:- 


follows F-distribution with k, n- 
17.5.3. Test for the Significan 


Let (xj, yy), @= 1,2. ALJ 
population and let 


N= 


i 


Let n be the correlation rati 
The hypothesis to be tested 
“Population correlation rati 


F=- 


follows F-distribution with h — | 
17.5.4. Testing the Linearity a 


Size Mean S.D. Let n be the correlation rati 
10 3-0 29 arranged in / arrays, from a biv: 
12 4-0 3-2 


Test whether the samples come from the same normal population. 


. Two chemists A and B repeat a protein analysis 20 times. If X; and Y; are the values 


obtained by A and B respectively and if 
XX; = 196, LX? = 1928, LY;=205 and LY? = 2105 
Determine whether there is a significant difference in precision between the two sets 
of results, the precision being measured by the inverse of the variance. 
[Ans. Not significant] 


. The means of two random samples of sizes 9 and 7 respectively are 196-42 and 198-82. 


Their respective variances are 26-94 and 18-73. Can the samples be regarded as drawn 
from the same normal population. 
Two random samples gave the following results : 


[Ans. No] 


The test statistic for testing 


F=- 


which follows F-distribution wil 
17.6. Fisher’s z-Distribution 


Fisher’s variate is defined b 
Zen + 


where F follows F distribution \ 
: PF = ¢ 


Size Sample mean Sum of squares of deviations 
from the sample mean 
Sample I: 10 15 90 
Sample II : 12 14 108 Dist. of z is dP = 


Test whether the samples come from the same normal population. 
Show how you would use Snedecor F-test to decide whether the following two samples 
have been drawn from the same normal population : 


This distribution is called F 


Size mean: Sum of squares of 17.7. z-tests 
. deviations from mean Tests of significance based 
Sample I: 9 68 36 z-tables provide critical vah 
Sample HT: 10 69 42 


Significance is tested by cc 


17.5-2. Test for the Significance of an Observed Multiple Correlation Coefficient 

Consider a random sample of size n from a (k + 1) variate normal population. Let R be 
the multiple correlation co-efficient of a variate with k other variates. Hypothesis to be 
tested is 


certain level of significance (5% 
If Zo.05 > Zcai the ratio is insig 
Similarly for 1% level. 
Some z-tests are as below : 


Zt 
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Sa ieiaeei ince eee 


-due to two different diets in the 


en that Fo.os for 9 and 9 df is 
[Ans. Not significant] 
ations are : 
24, 25 and 19 
28, 41, 43, 30and37 
ce. [Ans. 2:14 Not significant] 
es were tested for variability of 
ge showed a variance of 20 and 
variance of 15. Discuss whether 
[Ans. Not significant] 
s of the deviations of the sample 
‘ample of 10 observations it was 
4 level, given that Fo.9s for 7 and 
[Ans. Not significant] 
f deviations from their respective 
juares respectively. Can they be 
.? [Ans. 1-54, Not significant] 


ial population. 
times. If X; and Y; are the values 


= 205 and ZY? = 2105 
n precision between the two sets 
tse of the variance. 

[Ans. Not significant] 
spectively are 196-42 and 198-82. 
the samples be regarded as drawn 

[Ans. No] 


Sum of squares of deviations 
from the sample mean 
90 
108 
yal population. 
vhether the following two samples 


Sum of squares of 
deviations from mean 
36 
42 


ple Correlation Coefficient 


riate normal population. Let R be. . 


other variates. Hypothesis to be 
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“The multiple correlation co-eff. in the population is zero” 
Assuming this hypothesis, the statistic 


R*  n-k=-1 
F=-!;: 


follows F-distribution with k,n -k-ldf. 
17.5.3. Test for the Significance of an Observed Sample Correlation Ratio 


Let.(x1,. V9); GF 1,202 J = Ty ose nj) be a random sample from a bivariate normal 
population and let 


h 
N= » nj 
i=l 


Let 1 be the correlation ratio of y on x. 
The hypothesis to be tested is : 
“Population correlation ratio is zero” Assuming this hypothesis, the statistic 
1? N-h 
j-— n 2 h-1 
follows F-distribution with h — 1 and N-h. df. 
17.5.4. Testing the Linearity of Regression 
Let n be the correlation ratio and r the correlation coefficient for a sample of size of N 
arranged in h arrays, from a bivariate normal population. 
The test statistic for testing the hypothesis of linearity of regression is 
1 - r> N-h 
l-n* h-2 
which follows F-distribution with h — 2 and N—h dfs. 
17.6. Fisher’s z-Distribution 
Fisher’s variate is defined by 


1 
| z= 9 log. F 
where F follows F distribution with v,, v2 4.fs. 
; F=e& 
vhs, M2: 
2v,2 ve A z 
Dist. of z is dP = dz,~0<z<o0o 


: Vy +2 
(2.2) (v,+y,e77) 2 


This distribution is called Fisher’s z-distribution. 


17.7. z-tests 

Tests of significance based on z-distribution are called z-tests. 

z-tables provide critical values of z for various values of v,, v2 at 5% or 1% levels. 

Significance is tested by comparing the calculated value of z with tabulated value at 
certain level of significance (5% or 1%). Rules of decision are : 

If 20.05 > Zcai the ratio is insignificant and if zo.95 < Zcai, the ratio is significant at 5% level. 
Similarly for 1% level. 

Some z-tests are as below : 
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ere 


17.7-1. Test for Equality of Variance 
Consider two independent random samples x), X9, ..... Xn, ANd Yj, V2, .--- Yny drawn from 
normal populations. The hypothesis to be tested is : 
‘The population variances are same’ 
Assuming this hypothesis, the statistic 


1 a _1 ve 
z = — log (log, 10) lo 
2 e ri 7 e 810 s? 


(2-3026) S? S? 
= A —logyy “b= 1-1513 logyg “Lh 
2 10 se 10 3 


where S," = x (x; -x)* and S}= et : (0; -~y)? 


Ny ~1/ 
follows z-distribution with v, = 1, — 1 and vy. =n) -1 a 


Ex. 17-25. Show how you would use student’ t-test and Fisher’ z-test to decide whether 
the two sets of observations 
17, 27, 18, 25, 27, 29, 27, 23, 17, 
16, 16, 20, 16, 20, 17, 15, 21 
indicate samples drawn from the same universe. 


Sol. Let x and y be the variables for two samples respectively. 
x: 17 27 18 25 27 29 27 23 17 ‘Total 
X=x-23: -6 4 -5 2 4 6 4 0 -6= 3 
xX: 36 16 25 4 16 36 16 0 36= 185 
y: 16 16 20 16 20 17 15 21 


Y=y-16: 0 0 4 0 an ae | 5 ie 
ae 0 0 16 0 16 1 1 25 = 59 
x asi y=l1 reals 
322 8 
Sex) = BXaXyY = sx? —) (zy 
ny 


185 -5 (9) = 184 


and L(y~yy? = B(Y-Y¥)? 
= ey?) ay? = 59- 5 (169) == 
184.» 303. 
AY 2 = —, S =, 
8’? 56 
Firstly the equality of population variances will be tested by applying z-test. 
Se 1288 
Now Zz = 1-1513 logio a = (1-1513) logig —— 303 


(1-1513) {logio 1288 — logy, 303) 
= (1-1513) {3-1099 — 2-4814} ~ 0-724 
Now 2005 for 8 and 7 df. = 0-6576 < Z¢ay 
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and Zoo; for 8 and7 df = 
At 5% level the varia: 
At 1% level, the two : 
Now (-test will be applied t 


Ss 


i = 


No. of df = 

fo.os = 

teal ° 

The difference betwe 
The two samples do r 


Ex. 17-26. (i) Give the test 
z-transformation. 


(ii) A correlation co-efficic 
transformation to find out if this 

Sol. (i) Let ‘r’ and ‘p’ be 
respectively and n the sample si: 

Fisher z-transformation is 


g¢=- 


For large values of ‘n’, ‘z 
where 


and variance 
n- 


For large values of n, 


is asymptotically standard norma 
by calculating ‘wv’ and using nor 
Thus if |u| > 1-96, the differe 
is significant at 1% and if |u| > 3 
Note. The symbol ‘z’ used 
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6 ee 


ny AN Vi, V25 +++ ng drawn from 


’ 


ame 


1 st 

og 
10 52 
2 


3 lo 


(vj -yY 


| Fisher ' z-test to decide whether 


pectively. 


29 27 23. «17 


Total 


6 4 0 -6= 3 


36 16 0 36= 185 
17 15 21 
1 —-1 5 = 13 
1 1 25 = 59 
141 
8 
“xy 
1 303 
—(169)=—— 
3 ) 8 
‘sted by applying z-test. 
1513)1 bee 
O810 303 
210 303) 
4} = 0-724 
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and 2.9) for 8 and 7 d.f =0-9614 > Z¢q 
At 5% level the variance ratio is significant and at 1% level not significant. 
At 1% level, the two population variances may be taken to be same. 
Now f-test will be applied to test the significance of the difference between the means. 


ny . 
y (%-%) + 3 0-5 184425 
8 


S = f=1 j= it = 
Ny +n, -2 15 
_ 1775 
120 
70 141 


a BEY oe 8 Be 
\| = i cle ae 1 ; T ~ 3-05 
se 120 ¥8 9 
No. ofd ff = 9+8 -2=15 
toos = 2:13 and to; = 2-95 
teal > to.01 and f0.05 
The difference between the means is significant. 
The two samples do not belong to the same universe. 
Ex. 17-26. (i) Give the test of significance of correlation co-effi cient hated on Fisher 


z-transformation. 


(ii) A correlation co-efficient of 0-7 is discovered in a sample of 28 pairs. Apply z- 
transformation to find out if this differs significantly (a) from 0; (b) from 0-5. 

Sol. (i) Let ‘r’ and ‘p’ be the correlation co-efficient for sample and the population 
respectively and n the sample size 

Fisher z-transformation is 


1 
jig ais sions 
l-r 1- 


2 
For large values of ‘n’, ‘z’ is distributed asymptotically normally about the mean ‘§’ 


where 
1 l+p 1+p 
= — log, —— =1-1513lo ——— 
E 2 Be 1 810 l-p 


: 1 
and variance ——— 
n-3 


For large values of n, the variate 
uae G 
1 
n-3 


is asymptotically standard normal variate. Thus the significance between ‘7’ and ‘p’ is tested 
by calculating ‘uv’ and using normal tables. 

Thus if |u| > 1-96, the difference is significant at 5% level and if |u| > 2-58, the difference 
is significant at 1% and if |u| > 3, the difference is highly significant. 

Note. The symbol ‘z’ used here is different from Fisher’s z-distribution. 
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(ii) (a) n= 28, p=0, r=0-7 i 
e § = 0, z=(1-1513) logio 93 
= (1-1513) {logio 17 — login 3} 
= (1-1513) {1-2304 — 0-4771} = 0-87 
u = (0-87) J25 =4-35 >3. 


The hypothesis of zero correlation is refuted and hence the population is correlated. 
(5) n= 28, p=0-5, r=0-7 


§ 


1-5 
(11513) logio 5g = (1-1513) logyo 3 


= (1-1513) (0-4771) = 0-55 
Also z = 0-87 
: (0-32) /25 = 1-60 < 1-96, 
The difference between r and P is not significant and hence the hypothesis that 
p = 0-5 is acceptable. 
Ex. 17-27. What is the probability that a correlation co-efficient of 0-75 or less can 
arise in a sample of 30 from a normal population in which the true correlation is 0-9 ? 
Sol. r = 0:75, p=0-9, n =30 


u 


E = (1-1513) logig aa = 1-1513 logy 19 
(1-1513) (1-2788) ~ 1-472 


I 


1-75 
(1-1513) logy (<2) = (1-1513) log, 7 


Z = 
= (1-1513) (0-8451) ~ 0-973 
u = (0-973 ~ 1-472) 27 ~~2-59 
Now  P {r<0-75} = P{l+rsi7s}=piit7 <1 _ 
ip 0295 


P{z < 0-973} = P{u <— 2-59} 
0-5 — P{- 2:59 <u <0} =0-5-P{0<u< 2°59} 
_ = 0:5 -0-4952 = 0-0048, 

Ex. 17-28. /n a random sample of 19 pairs of values from a bivariate normal population, 
the correlation was found to be 0-7. Is this value consistent with the assumption that the 
correlation in the population is 0-5 ? (Ans. u = 1-28] 

Ex. 17-29. (a) Give the procedure of testing the significance of the difference between 
two independent correlation co-efficients. 

(b) The correlation coefficient between temperature of rice and breakage percentage 
calculated from two samples of 12 and 16 are 0-8912 and 0-8482 respectively. De *he two 
estimates differ significantly ? 

Sol. (a) Let there be two independent random samples of sizes N\, Nz and correlation 
co-efficients r;, r2. The hypothesis to be tested is : 


“Can the samples be regarded as drawn from the same population or from two 
populations with the same correlation co-efficients’. 


Assuming this hypothesis, the statistic 


I 


21 —22 


1 1 
+ 
ny —3 Uo) -3 


u= 
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where z= ( 


is asymptotically standard norma 
Thus the significance of the , 
using normal tables. 


(c) : mo. 

Zo (1 

é o ( 
= (1 

= ds 

20> (1 

(1- 

(1- 

1 

“=— 


.. Difference between r| al 
Ex. 17-30. The first of two sai 
while the second of 28 pairs has ac 


Sol. ny = 23, 
Zz = (1:1 

= (1 

Fae 

= (1 

—— 

1 


Difference is not signific: 

Ex. 17-31. The correlation co 

aptitude for a group of 20 girls is 0- 
significant? — 
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3} 
} = 0-87 


ence the population is correlated. 


313) logio 3 


it and hence the hypothesis that 


co-efficient of 0-75 or less can 
the true correlation is 0-9 ? 


13 logig 19 
2 


11-1513) log,g 7 


£ 1-75 =7 
0-25 


9} 
5 — P{0 <u <2-59} 


na bivariate normal population, 
nt with the assumption that the 


(Ans. «= 1-28] 
‘cance of the difference between 


frice and breakage percentage 
):8482 respectively. De +h two 


s of sizes n,, nz and correlation 


same population or from two 
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where z= (1-1513) logo = (i= 1,2) 
wh 
is asymptotically standard normal variate. 


Thus the significance of the difference between r1 and rp is tested by calculating u and 
using normal tables. 


(c) : ny = 12, ny = 16, r, =0-8912, rp = 0:8482 


z, = (1-1513) {los oat 


0-1088 


= (1:1513) {logio 18912 - logiy 1088} 
(1-1513) {4:2767 — 3-0366} = (1-1513) (1-2401) 


= 1-428 
1.8482 
1-:1513) 41 
( ) {los aa 
(11513) {logig 18482 — logo 1518} 
(11513) {4-2667 — 3-1813} = (1-1513) (1-0854) ~ 1-250, 
1-428—1-250 


== = 0-41 < 1-96. 
1. 1 
—_——~— + 
12-3 16-3 


’ 


Difference between r; and r, is not significant. , 
Ex. 17-30. The first of two samples consists of 23 pairs and gives a correlation of 0-5 
while the second of 28 pairs has a correlation of 08. Are these values significantly different ? 
Sol. ny = 23, m=28, r,=0-5 and r2 = 0-8 


ll 


22 


ul 


Zz 


= (1-1513) {tos a = (1-1513) logyo 3 
= (1-1513) (0-4771) = 0-5493 


Z = (11513) logio cs = 1-1513 logio 9 


= (1-1513) (0-9542) = 1-0986 
1-0986—0-5493 
lu| = Seg = 1-83 < 1-96 
—_— + — 
20 25 
Difference is not significant. ; 

Ex. 17-31. The correlation co-efficient between Mathematics aptitude and Physics 
aptitude for a group of 20 girls is 0-42 and for a group of 25 boys is 0-75. Is the difference 
significant ? [Ans. 1-6, Not significant] 
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LOGARITHMS 


Table 1 
LOGARITHMS 


fe 


E 
| 
4 


123] 4 5 6f7 8 9 
17 21 26 


0128} 0170 5 9.93 30 34 38 
02121} 0253] 0294) 0334] 0374] 4 8 12 16 20 24] 28 32 36 
0492 | 0531} 0569 4 8 12] 16 20 23] 27 31 35 
0607! 0645] 0682} 0719] 0755} 4 7 41] 15 18 22 | 26 29 33 
0899 | 0934 3 7 11] 14 18 21125 28 32 
0969} 1004] 1038] 1072 3 7 10] 14 17 20] 24 27 31 
13] 1139) 1173] 1206] 1239] 1271 3 6 10] 13 16 19] 23 26 29 
1303} 1335] 1367] 1399] 1430] 3 7 10] 13 16 19 | 22 25 29 
1492] 1523 | 1553] 1584 : 3.6 9] 12 15 19] 22 25 29 
1644] 1673] 1703) 1732] 3 6 9] 12 14 17] 20 23 26 
15] 1761} 1790] 1818] 1847] 1875 3.6 9] 11 14 17] 20 23 26 
1903] 1931] 1959] 1987] 2014] 3 6 8] 41 14 17] 19 22 25 
2041} 2068} 2095} 2122] 2148 3 6 8] 11 14 16] 19 22 24 
2175} 2201f 2227) 2253} 2279] 3 5 8| 10 13 16} 18 21 23 
17} 2304] 2330] 2355] 2380] 2405 3.5 8] 10 13 15] 18 20 23 
2430] 2455] 2480] 2504} 2329] 3 5 81| 10 12 15 | 17 20 22 
2553] 2577] 2601] 2625] 2648 25 7) 9 12 14/17 19 21 
2672] 2695] 2718} 2742} 2765 2 4 7] 9 11 14] 16 18 21 
2788] 2810] 2833] 2856] 2878 2 7] 9 1b 13-[14 18 20 
2900} 2923] 29451 2967] 2989] 2 6] 8 11 13] 15 17 19 

20 13 


21 12 
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3 Table II 
LOGARITHMS 


2] 2 
2] 2 
2) 2 
2] 2 
2) 2 
2) 2 
2h. 2 
2) 2 
2) 2 
24) 2 
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Table 111 


ANTILOGARITHMS 


2 4 
2 4 
2 4 
2 4 
2 4 
2 4 
2 4 
2 4 
2 4 
2 4 
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Table 1V 
ANTILOGARITHMS 


2-3026 | . 4.6052 


HYPERBOLIC OR NAPERIAN LOGARITHMS 


eg Ry Table V 


HYPERBOLIC OR NAPERIAN LOGARITHMS 


789 
29; 38 67 76 86 

26] 35 61 78 

32 56 72 

30 52 67 

28 48 62 


26 45 58 
24 42 55 
24 40 

22 38 49 
20 36 46 


34 44 
33 42 
31 40 
30 38 
29 37 


27 35 
26 34 
25 33 
25 32 
24 31 


23 30 
22 29 
22 28 
21 27 

26 
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23 
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Table VI 
HYPERBOLIC OR NAPERIAN LOGARITHMS EXPONENTL: 
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EXPONENTIAL AND HYPERBOLIC FUNCTIONS 


Table VII 
EXPONENTIAL AND HYPERBOLIC FUNCTIONS 


cosh 


SARITHMS 


ors aaa 


17-8819 | 19-5793 | 21-2767 


; 1 i 
cosh x= ste +e*), sinh x= 5 (e‘-e*) 
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Table VIII 
POWERS, ROOTS AND RECIPROCALS . 


132651 
140608 
148877 
157464 
166375 


175616 
185193 
195112 
205379 
216000 


226981 
238328 
250047 
262144 
274625 


287496 
300763 
314432 
328509 
343000 


357911 
373248 
389017 


405224 
421825 


438976 
i 456533 
474552 
493039 
512000 


531441 
551368 
571787 
592704 
614125 


636036 
658503 
681472 
704969 
729000 


753571 
778688 
804357 
830584 
857375 


884736 
912673 
941192 
970299 
1000000 


117649 
125000 


POWERS, ROOTS AND RECIPROCALS 


Table IX 
POWERS, ROOTS AND RECIPROCALS 


132651 
140608 
148877 
157464 
166375 


175616 
185193 
195112 
205379 
216000 


226981 
238328 
250047 
262144 
274625 


287496 
300763 
314432 
328509 
343000 


357911 
373248 
389017 
405224 
421825 


438976 
456533 
474552 
493039 
512000 


531441 
551368 
.571787 
592704 
614125 


636036 
658503 
681472 
704969 
729000 


753571 
778688 
804357 
830584 
857375 


884736 
912673 
941192 
970299 
1000000 


AREAS UNDER STANDARD NORMAL CURVE 


fu | 0-00 0-01 


0000 = -0040 
0398 = -0438 
0793-0832 
1179-1217 
1554-1591 


1915-1950 
‘22575-2291 


0-02 


-0080 
0478 
-0871 

1255 
1628 


1985 
2324 


‘2580 -26115 -2642 
2881  -2910 
3159, -3186 


3413-3438 
3643-3665 
3849 = --3869 
4032  -4049 
4192 = -4207 


4332-4345 
4452-4463 
4554-4564 
‘4641-4649 
‘AT13. -4719 


‘4772-4778 
4821 -4826 


2939 
3212 


3461 
3686 
3888 
4066 
4222 


4357 
4474 
4573 
4656 
4726 


4783 
-4830 


‘4861 48645-4868 


4893-4896 
‘4918  -4920 


-4938  -4940 
-4953 -4955 
4965 » -4966 
‘4974 = -4975 
4981 = -4982 


49865 -4987 


-4990 = -4991 
4993-4993 
4995-4995 
‘4997 — -4997 


4998  -4998 
-4998  -4998 
‘4999 = -4999 
‘4999 ~~ -4999 


-49995 -49995 


4898 
4922 


4941 
4956 
-4967 
-4976 
-49825 


4987 
4991 
-4994 
4995 
4997 


-4998 
-4999 
-4999 
“4999. 
5000 


0-03 


Table X 
0-04 0-05 


0199 
0596 
0987 
1368 
-1736 


-2088 
2422 
2734 
3023 
-3289 


3531 
3749 
3944 
“4115 
4265 


4394 
-4505 
4599 
-4678 
4744 


4798 
4842 
4878 
-4906 
-4929 


-4946 
-4960 
-4970 
‘4978 
4984 


4989 
4992 
-4994 
-4996 
-4997 


4998 
-4999 
-4999 
-4999 
_ 5000 


(x) 


0-06 


0239 
0636 
1026 
1406 
1772 


2123 
2454 
2764 
3051 
3315 


3554 
3770 
3962 
4131 
4279 


-4406 
4515 
-4608 
4686 
-4750 


4803 
4846 
4881 
-4909 
4931 


4948 
‘4961 
497) 
-4979 
4985 


-4989 
4992 
4994 
-4996 
-4997 


4998 
-4999 
-4999 
-4999 
-5000 


0-07 


0279 
0675 

-1064 
1443 
1808 


2157 
2486 
2794 
3078 
3340 


3577 
3790 
3980 
‘4147 
4292 


4418 
4525 
4616 
4693 
4756 


4808 
4850 
4884 
4911 
4932 


4949 
-4962 
-4972 


0-08 


0319 
0714 
1103 
1480 
1844 


-2190 
2518 
-2823 
3106 
3365 


“3599 
3810 
3997 
4162 
4306 


4429 
4535 
4625 
-4699 
‘4761 


4812 
4854 
4887 
4913 
4934 


4951 
4963 
4973 


“49795 -4980 


4985 


4989 
4992 
-4995 
-4996 
4997 


4998 
-4999 
-4999 


49995 - 


-5000 


4986 


-4990 
-4993 
-4995 
4996 


0-09 


0359 
07535 
1141 
1517 
1879 


-2224 
-2549 
2852 
3133 
‘3389 


3621 
3830 
‘4015 
4177 
4319 


444) 
“4545 


*-4633 


-4706 
4767 


4817 
4857 
4890 
4916 
-4936 


4952 
4964 
4974 
4981 
4986 


-4990 
4993 
4995 
4997 


49975-4998 


-4998 
-4999 
-4999 


5000 


4998 
-4999 
-4999 


49995 -49995 


5000 


AREAS UNDER STANDARD NORMAL CURVE 


Table XI 


2 0-00 0-01 0-02 0-03 0-04 0-05 0-06 0:07 0-08 0-09 


re a 


SIGNIFICANCE POINTS OF w? 


Table XII 
POBABILITY 
99 95 90 -70 50 :30 10 -05 01 
| 08157 -00393 0158) = 143-455 1-074 2-706 3-841 6-635 


0201 103 211 ‘713 1-386 2-408 4-605 5-991 9-210 
“115 352 584 1-424 2.366 3-665 6-251 7-815 11-345 
297 ‘T11 1-064 2-195 3.357 4-878 7-779 9-488 13-277 
554 1-145 1610 3-000 4-351 6-064 9:236 11-070 15-086 


872 1-635 2-204 3-828 5-348 7-231 10-645 12-592 16-812 
1-239 2-167 2-833 4-671 6-346 8-383 12-017 14-067 18-475 
1-646 . 2-733 3-490 5-527 7-344 9-524 13-362 15-507 20-090 
2-088 3-325 4-168 6-393 8343 10-656 14-684 16-919 21-666 
2:558 3-940 4865 7-267 9-342 11-781 15-987 18-307 23-209 


3-053 4-575 5578 8-148 10-341 12-899 17-275 19-675 24-725 
3-571 5-226 6-304 9-034 11-340 14-011 18549 21-026 26-217 
4-107 5-892 7-042 9-926 12340 15-119 19-812 22-362 27-698 
4-660 6-571 7-790 10-821 13-339 16-222 21-064 23-685 29-141 
5-229 7-261 8-547 11-721. 14-339 17-322 22-307 24-996 30-578 


5-812 7-962 9-312 12-624 15-338 18-418 23-542 26-296 32-000 
6-408 8-672 10-085 13-531 16-338 19-511 24-769 = 27-587 33-409 
7-015 9-390 10-865 14-440 17-338 20-601 25-989 28-869 34-805 
7-633 10-117 11-651 15-352 18-338 =. 21-689 27-204» 30-144 36-191 
8-260 10-851 12-443 16-266 19-337 22-775 28-412 31-410. 37-566 


8-897 11-591 13-240 17-182 20-337 23-853. 29-615 32-671 38-932 
9542 12338 14-041 18-101 21-337 24-939 30-813 33-924 40-289 
10196 13-091 14-848 19-021 22-337 26-018 32-007 35-172 41-638 
10-856 13-848 ~—-15-659 19-943 23-337. 27-096 33-196 36-415 42-980 
11-524 «14-611 16-473. 20-867 24-337 28-172 34-382 © 37-652 44-314 


12198 15-379 17-292 21-792 25-336 29-246 35-563 38-885 45-642 : f 
12-879 16-151 18-114 22-719 26-336 30-319 36-741 40-113 46-963 
13-565° 16-928 18-939 23-647 27-336 31-391 37-916 41-337 48-278 
14-256 17-708 19-768 24-577 28-336 32-461 39-087 42-557 49-588 
14-953 18-493 20-599 25-508 29-336 33-530 40-256 43-773 50-892 


Table XIII 
POBABILITY 


6-314 
2-920 
2-353 
2-132 
2:015 


1-943 
1-895 
1-860 
1-833 
1-812 


1-796 
1-782 
1-771 
1-761 
1-753 


1-746 
1-740 
1-734 
1-729 
1-745 


1-721 
1-717 
1-714 
1-711 
1-708 


1-706 
1-703 
1-701 
1-699 
1-697 


1-684 
1-671 
1-658 
1-645 


VALUES OF ‘nod ¢’ 


12-706 
4-303 
3-182 
2-776 
2-571 


2-447 
2-365 
2306 
2-262 
2-228 


2-201 
2-179 
2-160 
2-145 
2-131 


2-120 
2-110 
2-101 
2-093 
2-086 


2-080 
2-074 
2-069 
2-064 
2-060 


2-056 
2-052 
2-048 
2-045 
2-042 


2-021 
2-000 
1-980 
1-960 


63-657 
9-925 
5-841 
4-604 
4-032 


3-707 
3-499 
3-355 
3-250 

3-169 


3-106 
3-055 
3-012 
2-977 

2-947 


2-921 
2-898 
2-878 
2-861 
2-845 


2-831 
2-819 
2-807 
2-797 
2-787 


2-779 
2-771 
2-763 
2-756 
2:750 


2-704 
2-660 
2-617 
2:576 


5% POINTS OF F 
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1% POINTS OF F 
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5% POINTS OF Z 


Table XVI 
5% POINTS OF Z 


26479 2-6870 2-7071 2-7194 2-7276 2:7380 2-7484 2-7588 2-7693 
14722 1-4765  1-4787 1-4800 1-4808 1:4819 1-4830 1-4840 1.4851 
11284 1-1137  1-1051 1-0994 1-0953 1-0899 1-0842 1-0781 1-0716 
9690 -9429 -9272 -9168 -9093 -8993 -8885 -8767 -8639 
8777 — 8441 8236-8097 -7997 _-7862 «--7714_~—-7550_—-7368 


41535 
2:2950 2-2976 2-298¢ 
1-7649 1-7140 1-691 
15270 1-4452 1.4075 
13973 1:2929 1-244¢ 
‘8188 -7798  -7558 -7394 -7274 -7112 -6931 -6729 -6499 
‘171717 —--7347,—-7080-S 6896) 6761-6576 «= 6369, -6134 -5862 
‘7475. -7014.— 6725-6525 6378 «6175-5945 5682-5371 
-7242 -6757 -6450 -6238 +6080 -5862 -5613 +5324 -4979 
‘7058 6553. -6232-— 6009 = -5843. 5611-5346 = 5035-4657 


13103 1-:1955 1-140} 
12526 1-1281 1-0672 
1-2106 1-0787 1.0135 
11786 1-0411  -9724 
11535 1:0114 — -9395 
6909 -6387 +6055 -5822 5648-5406 -5126 ©=-4795_— -4387 

6786 +6250 -5907 -5666 -5487 -5234 -4941 +4592 -4156 

6682-6134 +5783. 5535-5350. 5089 «-4785 4419 = +3957 
6594-6036 5677. —- 5423. 5233-4964 -4649 = -4269 = -3782 
6518 +5950 -5585 -5326 +5131 4855-4532) -4138 = 3628 


11333-9874 = _-9136 
11166 -9677  -8919 
1:1027  -9511 — -8737 
10909-9370 -8581 
10807 -9249  -8448 
6451-5876 5505 = 5241-5042 4760-4428 «= 4022. 3490 | 
6393-5811 -5434  -5166 »= 4964-4676 «4337 = 3919-3366 | 
6341 +5753. 5371 5099-4894. -4602 4255-3827) -3253 

6295 5701-5315 5040-4832: -4535—— «4182 = +3743 «3151 

6254 -5654 -5265 -4986 -4776 -4474 -4116 -3668 -3057 


10719 -9144 — -8331 
10641 +9051  -8229 
10572-8970 —--8138 
10511-8897 -8057 
10457-8831 = -7985 
6216 +5612 5219-4938 = 4725-4420 --4055) 3599-2971 
6182-5574. «5178 = -4894. = 4679-4870 «4001 = -3536 = -2892 
6151  -5540 5140-4854 4636 «4325 «3950 = 3478-2818 
6123-5508 «5106 = -4817 4598 «= 4283. 3904-3425. -2749 
6097-5478 = -5074 = --4783.— 4562 »«-4244: «3862 = 3376 = -2635 


10408 -8772  -7920 
1-0363 +8719 = -7860 
23 | 10322 -8670 +7806 
24 11-0285 +8626 = _-7757 
25 | 1-0251 -8585 = +7712 


6073. 5451 = 5045-4752. 4529-4209 -3823 3330 = -2625 
6051-5427 -5017_—--4723— 4499-4176 «3786 = 3287-2569 
6030 -5403 -4992 -4696 -447/1 -4146 -3752 -3248 -2516 
“6011 -5382 -4969 -4671 -4444 -4117 -3720 -3211 -2466 
5994-5362 4947-4648 «= 4420-4090 3691 3176-2419 


26 | 1:0220 -8548 = -7670 
27 {10191 8513-7631 
28 | 10164 -8481 = -7595 
29 | 1-0139 +8451 = -7562 
30 | 10116 -8423 -7531 
5866 -5217  -4789 -4479 4242-3897) 3475-2920) -2057 
5738 -5073.- -4632,—s -4311 = -4064 «+3702 «3255-2654 1644 
5611 -4930 -4475 -4143 3885-3506 = -3032)= -2376—-1131 
5486 -4787 -4319-3974. 3706 «3309-2804 = -2085 0 


40 | -9949: -8223  -7307 
60 | -9784 -8025 -7086 
120 | 9622 -7829 -6867 
9462 


(xvi) 


2:7380 2:7484 2:7588 2-7693 
1-4819 1-4830 
10899 1-0842 


8993 
7862 


‘7112 
6576 
6175 
5862 
‘5611 


5406 
5234 
5089 
4964 
4855 


4760 
4676 
4602 
4535 
4474 


4420 
‘4870 
“4325 
4283 
4244 


4209 
4176 
4146 
‘4117 
-4090 


3897 
3702 
3506 
3309 


8885 
‘7714 


6931 
6369 ,. 
5945 
5613 
5346 


5126 
4941 
4785 
4649 
4532 


4428 
4337 
4255 
4182 
A116 


-4055 
-4001 
3950 
3904 
3862 


3823 
3786 
3752 
3720 
3691 


3475 
3255 
3032 
2804 


1-4840 1-4851 
1-0781 1-0716 


8767 
-7550 


6729 
6134 
5682 
5324 
5035 


‘4795 
-4592 
-4419 
-4269 
4138 


-4022 
“3919 
-3827 
3743 
-3668 


“3599 
-3536 
-3478 
3425 
3376 


-3330 
3287 
3248 
3211 
3176 


2920 
2654 
2376 
2085 


-8639 
-7368 


6499 
5862 
5371 
-4979 
4657 


4387 
4156 
3957 
3782 
3628 


-3490 
-3366 
-3253 
3151 
3057 


2971 
2892 
2818 
2749 
2635 


2625 
2569 
2516 
2466 
2419 


-2057 
+1644 
“1131 

0 


4-1535 


1-3973 


1-3103 


1-2106 
1-1786 
1-1535 


1-1333 
1-1166 
1-1027 
1-0909 
1-0807 


10719 
1-0641 
1.0572 
1-0511 
1-0457 


1-0408 
1-0363 
1-0322 
1-0285 
1-0251 


1-0220 
1-019] 
1-0164 
1-0139 
1-0116 


9949 
-9784 
9622 
9462 


2-2950 
1-7649 
1-5270 


1-2526 


4-2585 
2:2976 
1-7140 
1-4452 
1-2929 


1-1955 
1-1281 
1-0787 
10411 
1-0114 


9874 
9677 
9511 
9370 
9249 


9144 
9051 
8970 
8897 
8831 


8772 
8719 
8670 
8626 
8585 


8548 
8513 
8481 
8451 
8423 


8223 
8025 
-7829 
-7636 


4-2974 
2:2984 
1-6915 
1-4075 
1.2449 


11401 
1-0672 
1-0135 
9724 
9399 


9136 
8919 
8737 
‘8581 
8448 


8331 
8229 
8138 
8057 
7985 


-7920 
-7860 
-7806 
‘7757 
‘7712 


‘7670 
‘7631 
-7595 
-7562 
7531 


‘7307 
-7086 
-6867 
6651 


1% POINTS OF Z 


Table XVII 
1% POINTS OF z 


4-3175 
2-2988 
1-6786 
1-3856 
1-2164 


4-3297 
2-299 1 
1-6703 
13711 
1-1974 


4:3379 43482 4.3585 
2:2992 2:2994 2.2997 
1-6645 1:6569 1-6489 
1:3609 1-3473 1-3327 
1-1838 1-1656 1-1457 


4:3689 4.3794 
2-2999 2-3001 
1-6404 1-6314 
1-3170 1-3000 
11239 1-0997 


1:1068 1-0843 1-0680 1-0460 1:0218 -9948 .9643 
1-0300 1:0048 -9864 -9614 -9335 -9020 -8658 
‘9734-9459 9259-8983 -8673 8319-7904 
‘9299-9006 8791-8494 -8157 -7769 -7305 
8954 8646 8419-8104 -7744 —--7324 6816 


8674 «8354-8116 -7785 -7405 = 6958-6408 
8443-8111 = -7864--7520 --7122 6649-6061 
‘8248 =-7907— -7652--7295 6882-6386 +5761 
‘8082 -7732,—-7471_—«-7103- 6675 6159-5500 
‘7939-7582 --7314 6937-6496 5961-5269 


5064 | 


‘7814-7450 -7177--6791 6339-5786 

‘7705-7335. 7057 6663-6199 5630-4879 
‘7607-8232, 6950 6549-6075 5491-4712 
‘7521-7140 6854. 6447-5964 5366-4560 
‘7443-7058 = 6768 «6355-5864. — 5253-442] 


‘7372-6984 6690-6272. --5773— «5150-4294 
‘7309 6916-6620 6196-5691 -5056_-4176 
‘7251-6855 + -6555 6127 +5615 -4969 -4068 
‘1197-6799 6496-6064 +5545 = -4890 3967 
‘7148-6747 6442 6006-5481 = 4816-3872 
‘7103, 6699-6392 5952-5422. -4748 «—--3784 
‘7062-6655 = -6346 «5902-5367. -4685_—--3701 
‘7023, 6614 = 6303-5856 5316 = -4626-—--3624 
‘6987-6576 = 6263-5813 +5269 ~=—--4570-—--3550 
6954-6540 6226 5773-5224. 4519-3481 
6712 6283. 5956-5481 -4901 —-4138 = -2952 
6472 6028 = 5687 5189-4574 «3746-2352 
6234-5774 5419-4897 -4243— 3339-1612 
‘5999-5522, +5152. 4604-3908 ~~ -2913 0 


(xvi) 


Table X 


Areas under the Standard Normal Curve 


The area is measured from the mean ‘0’ to any ordinate ‘U’. . Significant values 1, of t for 


P : 
F 
The table gives the shaded area. 


P,, is the shaded area. 


The results are given for values of ‘U’ at intervals 0-01. 


Table XI 


Ordinates of the Standard Normal Curve | 
The table gives ordinates (y) erected at a distance ‘x’ from the mean i.e., v, is the number of degrees 
the smaller. 


Nl 


1 
= >< e 
2 V2n 
v, is the number of degrees 
the smaller. 


The results are given for values of ‘x’ at intervals 0-01. 


Table XII 


Significance Points of y? 7 


‘The table gives the values of y Be for different 
p’s and degrees of freedom ‘n’. 


p 


Shaded area = p 


For large values of n, the expression y2w? ~ ,/2n~| may be used as a normal variate 


with unit variance, remembering that the probability of y? corresponds with that of a single 
tail of the normal. 


(xviii) 


Curve 
om the mean i.e., i 


2 
ap 


Shaded area = p 


nay be used as a normal variate 


orresponds with that of a single 


Table XTiI 
Values of ‘mod. f° 


Significant values ¢, of ¢ for given probabilities P,. and d.f'v. where 


P, = P((t|>t) 


P,,is the shaded area. 


Table XIV 


Variance Ratio 
5% points of F 


v, is the number of degrees of freedom for the greater estimate of variance and v, for 
the smaller. 
Table XV 


Variance Ratio 
1% points of F 


v, is the number of degrees of freedom for the greater estimate of variance and v, for 
the smaller. 


QO 


(xix) 


A 


Absolute moments 69, 79 
Additive law of probability 203 
Arithmetic mean 4 
Ascending differences 114 
Association of attributes 102 
Asymmetrical 4 
Asymptotically unbiased estimate 566 
Attribute 91 
B 
Bayes theorem 250 
Bessel’s formula 139 
Beta distribution 487 
Binomial 337 
Bivariate distribution 425 
Bivariate normal distribution 463 
Bowley coefficient of skewness 81 
ee. © 
Cauchy distribution 494 
Cells 606 
Central differences 115 
difference formulae 139 
tendency 4 
Chance variate 268 
Characteristic function 294, 496 
Class | 
Class limits 1 
Coefficient of colligation 102 
contingency 625 
dispersion 5 
mean deviation 46 
skewness 81 
variation 50 
Compound probability theorem 206 
Conditional density function 433 
expectation 445 
probability function 425 
variance 446 
Confidence belt 582 
interval 566 
limits 582 
Consistency of attributes 91 
Contingency table 620, 621 
Continuous variable 301 
Correlation coefficient 506 
ratio 54] 
Cumulants 290 
Cumulative function 290 


Index 


Curve fitting 172- 
D 
Degrees of freedom 606 
Density function 273 
Descending differences 114 
Difference Operators 111 
Dispersion 44 
Distribution function 301 
Divided differences 111 
Double exponential distribution 492 
= 
Events 185 
Existent population 564 
Expectation 280 
Expected value 268 
Exponential distribution 490 
F 
Factorial 115 
Factorial moments 69 
Fiducial limits 582 
Finite population 564 
Frequency curve 3 _ 
distribution 1 
polygon 2 
G 
_Gamma distribution 484 
Generatic function 296 
Geometric mean 12 
Guass backward formula 139 
Guass forward formula 139 
H 
Harmonic mean 12 
Histogram 2 
Hypergeometric distribution 418 
Hypothesis 566 
Hypothetical population 564 
I 
Independent events 206 
Indicator function 272, 301 
Interpolation 111 
Interpolation formulae 127 
Interquartile range 44 
J 
Joint cumulative distribution 
function 426, 433 
Joint density function 426 ° 
Joint moment generating function 444 


K 
Kurtosis 86 

L 
Lagrange’s formulae 128 
Large sample 566 
Laws of Expectation 280 
Leptoburtic 86 
Level of significance 566 
Limitations (ii) 
Linear constraints 606 
Linear regression 506 
Lines of regression 506 
Lognormal distribution 493 

M 
Marginal cumulative distribution 

function 427 

Marginal density function 426, 4 
Marginal probability function 4: 
Mathematical expectation 268 
Mean 7 
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